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Transduction of recombinases for inducible gene targeting 

The present invention provides the use of a fusion protein comprising a 
site-specific DNA recombinase domain and a protein transduction domain 
for preparing an agent for inducing target gene alteration in a living 
organism or in cultured cells, suitable fusion proteins and a method for the 
production of said fusion proteins. 

Background 

For some years targeted mutagenesis in totipotent mouse embryonic stem 
(ES) cells has been used to inactivate genes, for which cloned sequences 
were available (Capecchi, Trends in Genetics 5, 70 - 76 (1989)). Since ES 
cells can pass mutations induced in vitro to transgenic offspring in vivo, it 
is possible to analyze the consequences of gene disruption in the context 
of the entire organism. Thus, numerous mouse strains with functionally 
inactivated genes ("knock out mice") have been created by this 
technology and utilized to study the biological function of a variety of 
genes. 

A refined method of targeted mutagenesis, referred to as conditional 
mutagenesis, employs a site-specific recombination system (e.g. Cre/loxP 
or Flp/frt - Sauer and Henderson, N. Proc. Natl. Acad. Sci. USA 85, 5166- 
5170 (1988); Senecoff et al., J. Mol. Biol., 201, 405 - 421 (1988)) which 
enables a temporally , and/or spatially restricted alteration of target genes 
(Rajewsky et al., J. Clin. Invest, 98, 600 - 603 (1996)). The creation of 
conditional mouse mutants requires the generation of two mouse strains, 
i.e. the recombinase recognition strain and the recombinase expressing 
strain. The recombinase recognition strain is generated by homologous 
recombination in ES cells as described above except that the targeted 
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exon(s) is (are) flanked by two recombinase recognition sequences ; 
(hereinafter "RRS"; e.g. loxP or frt). The type of recombination event 
mediated by the recombinase depends on the disposition of the RRS, with 
deletions, inversions, translocations and integrations being possible 
(Torres and Kuhn, Oxford University Press, Oxford, New York (1997)). By 
placing the RRS into introns, an interference with gene expression before 
recombination can be avoided. The recombinase expressing strain 
contains a recombinase transgene (e.g. Cre, Flp) whose expression is 
either restricted to certain cells and tissues or is inducible by external 
agents. Crossing of the recombinase recognition strain with the 
recombinase expressing strain recombines the RRS-flanked exons from 
the doubly transgenic offspring in a prespecified temporally and/or 
spatially restricted manner. Thus, the method allows the temporal analysis 
of gene function in particular cells and tissues of otherwise widely 
expressed genes. Moreover, it enables the analysis of gene function in the 
adult organism by circumventing embryonic lethality which is frequently 
the consequence of gene mutation. For pharmaceutical research, aiming 
to validate the utility of genes and their products as targets for drug 
development, inducible mutations provide an excellent genetic tool. 
However, the current systems for inducible recombinase expression in 
transgenic animals suffer from a certain degree of leakiness in the 
absence of the inducer (Kuhn et al., Science 269(5229): 1427-9 (1995); 
Schwenk et al., Nucleic Acids Res.; 26(6): 1427-32 (1998)). Furthermore, 
the generation of conditional mutants is a time consuming and labor 
intensive procedure, since the recombinase recognition strain and the 
recombinase expressing strain have to be breed at least over two 
generations in order to obtain animals carrying both, the recombinase 
transgene and two copies of the RRS-flanked target gene sequence. 

Protein tranduction domains (hereinafter shortly referred to as "PTD") that 
have the ability to cross cell membranes were identified, e.g. in the 
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Antennapedia protein from Drosophila (Vives et al., J. Biol. Chem/ 
272(25): 16010-7 (1997)), Kaposi fibroblast growth factor (Kaposi FGF; 
Lin et al., J. Biol. Chem. 270: 14255-58 (1995)), VP22 from HSV (Elliott 
and O'Hare, Cell, 88(2):223-33 (1997)) and TAT from HIV (Green and 
Loewenstein, Cell, 55(6): 1179-88 (1988); Frankel and Pabo, Cell, 
55(6):1189-93 (1988)). WO 99/29721 moreover mentions TAT mutants 
having an enhanced activity as compared to the wild-type peptide. 
Fusion of PTDs to heterologuous proteins conferred the ability to transduce 
into cultured cells (Fawell et al., Proc. Natl. Acad. Sci. USA, 91(2):664-8 
(1994); Elliott and O'Hare (1997), Phelan et al., Nature Biotech. 16; 440- 
443 (1998) and Dilber et al., Gene Ther., 6(1):12-21 (1999)). Dalby and 
Bennett showed that a fusion protein consisting of VP22 and functional Flp 
recombinase translocated between cells in culture (from COS-1 cells 
transfected with VP22-Flp to CHO cells carrying Flp recognition sites (FRT 
sites); see Dalby and Bennett, Invitrogen, Expressions 6.2, page 13 
(1999)). Further WO 99/11809 mentions a fusion protein Antp-Cre and 
emphasizes that it may be used to deliver the Cre into the cell which 
recombines inside the cell nucleus. It is mentioned that the fusion protein 
is suitable for manipulating genomic DNA at precise locations in a 
temporal regulated manner. 

Furthermore, a recent report demonstrated that the B-galactosidase 
protein fused to the 11 amino acids PTD from the HIV TAT protein can 
infiltrate all tissues of living mice reaching every single cell (Schwarze et 
al., Science, 285(5433): 1569-72 (1999)). Finally, WO 99/60142 discloses 
vector constructs for gene therapy carrying a tumor cell sensitizing gene, 
a sensitizing gene expression regulatory system, a control gene and a 
control gene expression regulatory system, wherein the control gene can 
be a fusion gene consisting of a recombinase (viz. Cre or Flp) and a 
trafficking protein (viz. VP22). 
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With regard to the fusion protein Antp-Cre of WO 99/11809, it is however, - 
general knowledge in the art that the Antennapedia PTD is not a generally 
applicable transducing protein, namely it has only a limited activity with 
proteins having more than 100 amino acid residues (Derossi et al., Trends 
Cell Biol. 8: 84-87, 1998). In view of the limited transducing activity of the 
Antp PTD and the size of the generally known recombinases (ranging from 
about 200 to about 600 amino acid residues), it was desirable to provide a 
more potent system for the transduction of recombinases. It was, 
however, not clear for a person skilled in the art whether PTDs would be 
effective at all with recombinases for the following reasons: 

(i) only a single example of PTD-mediated delivery of proteins (above 100 
amino acid residues) in vivo has been reported so far (Schwarze et al., 
Science, 285(5433): 1569-72 (1999); Fawell et al., PNAS, 91: 664-68 
(1994); both references describing the TAT-mediated transduction of 6- 
galactosidase in mice); 

(ii) It is known that - due to defolding and refolding processes - the 
transduction of native proteins into cells may result in a significant loss of 
protein activity (e.g., as described for TAT-GFP; Schwarze et al, Trends 
Cell Biol. 10: 290-95 (2000)); 

(iii) neither the number of protein molecules that can be transferred into a 
cell by a given translocation domain has been systematically determined, 
nor the number of Cre molecules in the cell nucleus that is required for 
efficient recombination; 

(iv) the delivery of active proteins requires unfolding- and proper refolding 
which is unpredictable for a given protein (Bonifaci et al., AIDS 9: 
995-1000 1995); and 

(v) the mechanism by which protein transduction domains facilitate 
protein transduction in unknown and several findings have been published 
that rule out classical receptor-, transporter-, endosome- or endocytosis- 
mediated processes in the transduction of Ant, TAT and VP22 (G. Eliott, P. 
O'Hare, Cell 88, 223-233 (1997); D.A. Mann, A.D. Frankel, EMBO. J. 10, 



WO 01/49832 PCT7EP01/00060 

5 

1733-1739 (1991); D. Derossi et al., J. Biol. Chem. 269, 10444-10450 ' 
(1994); D. Derossi et al., J. Biol. Chem. 271, 18188-18193 (1996); E. 
Vives et al., J. Biol. Chem. 272, 16010-16017 (1997)). 

Moreover, there was still the need for a generally applicable method where 
the genetic manipulation can be performed in both, endogenous genes 
and transgenes. 

Summary of the Invention 

It was found that site-specific DNA recombinase proteins can be 
translocated into cells of a living organism when fused to specific protein 
transduction domains, namely transduction domains being derived from 
the VP22 protein of HSV or from the TAT protein of HIV. Thus, whenever a 
gene mutation is desired, recombination is induced upon the injection of 
the appropriate site-specific recombinase fused to a transduction domain 
into such a living organism (provided, however, that said organism carries 
at least one appropriate RRS integrated in the genome). 

The present invention thus provides 

(1) the use of a fusion protein comprising 

(a) a site-specific DNA recombinase domain and 

(b) a protein transduction domain (PTD) 

for preparing an agent for inducing target gene alterations in a living 
organism or ceil culture, wherein said living organism carries at least one 
or more recognition sites for said site-specific DNA recombinase integrated 
in its genome; 

(2) a method for Inducing gene alterations in a living organism which 
comprises administering to said living organism a fusion protein 
comprising a site-specific DNA recombinase domain and a PTD as defined 
in (1) above, wherein said living organism carries at least one or more 
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recognition sites for said site-specific DNA recombinase integrated in its - J 
genome; 

(3) a fusion protein comprising 

(a) a site-specific DNA recombinase domain and 

(b) a PTD being derived from the VP22 protein of HSV or from the TAT 
protein of HIV 

provided that when the site-specific DNA recombinase domain is wild-type 
Cre or Flp then the PTD is not the full length VP22 PTD of HSV (i.e., the 
fusion protein is not identical to the fusion protein of Dalby and Bennett, 
Invitrogen, Expressions 6.2, page 13 (1999) and of WO 99/60142); 

(4) a DNA sequence coding for the fusion protein of (3) above; 

(5) a vector comprising the DNA sequence as defined in (4) above; 

(6) a host cell transformed with the vector of (5) above and/or comprising 
the DNA of (4) above; 

(7) a method for producing the fusion protein of (1) above which 
comprises culturing the transformed host cell of (6) above and isolating 
the fusion protein; and 

(8) an injectable composition comprising the fusion protein as defined in 
(1) or (3) above. 

The invention is further illustrated by the appended Figures and is 
explained in detail below. 

Description of the Figures 

Fig. 1: Generation of induced mouse mutants using purified fusion 
proteins. 

A: Expression of the fusion protein consisting of the site-specific DNA 
recombinase (e.g. Cre) and the protein transduction domain (e.g. the HIV 
derived TAT peptide) in prokaryotic or eukaryotic cells. 
B: Extraction and purification of the expressed fusion protein (e.g. as 
described in Nagahara et al., Nat. Med. 4 (12):1449-52 (1998)). 
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C: Injection of the purified fusion protein into mice carrying the RRS- 
flanked target sequence. 

D: Analysis of the pattern of induced target gene recombination and the 
resulting phenotype. 
Triangle: RRS. 

Fia. 2: Scheme of the bacterial expression vector pT7-TACS (SEQ ID 
NO:16). The coding region of the 11 amino acid protein transduction 
domain of HIV TAT protein is fused to the N-terminus of the Cre 
recombinase protein sequence. The 10-amino-acid strep tag and the 
protease factor Xa recognition sequence are fused to the C-terminus. The 
T7 promoter permits expression of TAT-Cre protein in E. coli. 

Fia. 3: Detection of purified TAT-Cre protein by Coomassie staining and 
Western blot analysis. 

A: Coomassie stained SDS-PAGE gel. Lane 1: 10 kDa ladder (Life 
Technologies, Cat. No.: 10064-012), 2: 1000 ng BSA, 3: 750 ng BSA, 4: 
500 ng BSA, 5: 100 ng BSA, 6: 50 ng BSA, 7: 5 pi TAT-Cre, 8: 1 pi TAT- 
Cre in Bicine buffer. 

B: Western blot analysis using an alkaline phosphatase-conjugated anti- 
strep tag antibody (IBA, Cat. No: 2-1503-001). Lane 1: MultiMark 
(Invltrogen, Cat. No.: LC5725), 2: 7 pi TAT-Cre, 3: 5 pi TAT-Cre, 4: 2,5 pi 
TAT-Cre, 5: 1,25 pi TAT-Cre in Bicine buffer. 

Fia. 4: X-Gal staining of M5Pax8 cells treated with TAT-Cre protein. 
M5Pax8 fibroblasts where treated for 18 h with 3,5 (A), 6,9 (B) and 13,8 
pg/ml TAT-Cre protein (C) in serum-free medium. Four days after 
treatment, cells were fixed and stained with X-Gal. 

Fig. 5: Measurement of 6-galactosidase activity in cell lysates. M5Pax8 
fibroblasts where treated for 18 h with increasing concentrations of TAT- 
Cre, as indicated, or transiently transfected with either expression vectors 
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for Cre (pCMV-I-Cre-pA, see SEQ ID NO:29) or B-galactosidase (pCMV-I- 
B-pA, see SEQ ID NO:30). Four days after treatment, cells were lysed and 
the B-galactosidase activities were determined. 

Fig. 6; PCR detection of TAT-Cre mediated recombination in mice. 
A: PCR-analysis of genomic DNAfrom duodenum (lane 2), liver (3), 
kidney (4), spleen (5), muscle (6), lung (7), tail (8) and brain (9) of a 
plnl3 mouse treated three times with intraperitoneal injections of 75 pg 
TAT Cre protein at two-day-intervals. Deletion of the loxP-flanked DNA 
segment is indicated by the presence of the about 400 bp fragment. Lane 
1: 1-kb^adder (Life Technologies). 

B: PCR strategy to detect Cre-mediated deletion of the loxP-ftanked DNA 
segment. Arrows Indicate the positions of the primers. 
C: PCR-analysis of genomic DNA from spleen of a plnl3 mouse treated 
three times with intraperitoneal injections of 75 pg TAT Cre protein at two- 
day-intervals (lane 4). To confirm the presence of the BamH I restriction 
site, the PCR product was digested with BamH I which produces two 
diagnostic fragments of about 190 and about 210 bp (5). As a control, tail 
DNA from untreated mice carrying the loxP-flanked (lane 2) and the 
detected plnl3 allele (3) was subjected to PCR amplification. Lane 1: 100 
bp ladder (Life Technologies), lane 6: 1 kb ladder (Life Technologies). 

Fia. 7: Scheme of the bacterial expression vectors pT7-VPCS (SEQ ID 
NO:17) and pCRT7-AVPCS (SEQ ID NO:15). The coding region of the 301 
amino acid protein transduction domain of HSV VP22 protein (A) or the 
truncated 143 amino acid AVP22 domain (B) is fused to the N-terminus of 
the Cre recomblnase protein sequence. The 10-amino-acld strep tag and 
the protease factor Xa recognition sequence are fused to the C-terminus. 
The T7 promoter allows the expression of VP22-Cre and AVP22-Cre fusion 
proteins in E. coli. The sequence in pCRT7-AVPCS encoding the 15 amino 
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acid N-terminal leader sequence is used for enhanced protein stability 
(Invitrogen). 

Fig- 8: Detection .of the purified VP22-Cre and AVP22-Cre fusion proteins 
by Coomassie staining and Western blot analysis. 
A: Detection of VP22-Cre protein in a Coomassie-stained SDS-PAGE gel. 
Lane 1: 10 kDa ladder, 2: 1000 ng BSA, 3: 500 ng BSA, 4: 100 ng BSA, 
5: inclusion body protein extract before chromatography, 6: unbound 
protein, 7: fraction 17, 8: fraction 18, 9: fraction 19, 10: fraction 20. The 
position of the 75 kDa VP22-Cre protein is indicated by the arrow head. 
B: Detection of VP22-Cre protein by Western blot analysis using an 
alkaline phosphatase-conjugated anti-strep tag antibody (IBA, Cat. No.: 2- 
1503-001). Lane 1: MultiMark (Invitrogen), 2: inclusion body protein 
extract before chromatography, 3: unbound protein, 4: fraction 10, 5: 
fraction 11, 5: fraction 16, 6: fraction 17, 7: fraction 18, 8: fraction 19, 9: 
fraction 19, 10: fraction 20. 

C: Detection of AVP22-Cre protein in a Coomassie-stained SDS-PAGE gel. 
Lane 1: 10 kDa ladder, 2: inclusion body protein extract before 
chromatography, 3: unbound protein, 4: fraction 1, 5: fraction 8, 6: 
fraction 9, 7: fraction 15, 8: 100 ng BSA, 9: 500 ng BSA, 10: 1000 ng 
BSA. The position of the 60 kDa AVP22-Cre protein is indicated by the 
arrow head. 

D: Detection of AVP22-Cre protein by Western blot analysis using a 
alkaline phosphatase-conjugated anti-strep tag antibody (IBA, Cat. No.: 2- 
1503-001). Lane 1: MultiMark (Invitrogen), 2: inclusion body protein 
extract before chromatography, 3: unbound protein, 4: fraction 4, 5: 
fraction 8, 6: fraction 10, 7: fraction 12, 8: soluble protein extract before 
chromatography, 9: unbound protein, 10: fraction 7. 



Fig. 9: X-Gal staining of M5Pax8 cells treated with VP22-Cre and AVP22- 
Cre fusion proteins. M5Pax8 fibroblasts where treated for 18 h with either 
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Bicine buffer (A), 0.5 ug/ml VP22-Cre (B) or 3.75 g/ml AVP22-Cre (C) in 
serum-free medium. Four days after treatment, cells were fixed and . 
stained with X-Gal. 

Fig. 10: Measurement of 6-galactosidase activity in cell lysates. M5Pax8 
fibroblasts where treated for 18 h with VP22-Cre, AVP22-Cre or Bicine 
buffer alone, as Indicated or transiently transfected with expression 
vectors for Cre (pCMV-I-Cre-pA, see SEQ ID NO: 29) or B-galactosidase 
(pCMV-I-B-pA, see SEQ ID NO:30). Four days after treatment, cells were 
lysed and the 6-galactosidase activities were determined. 

Fig. 11: PCR detection of Cre mediated recombination in cells treated with 
VP22-Cre and AVP22-Cre fusion proteins shown in SEQ ID NOs: 21 and 
14, respectively). 

A: PCR-analysis of genomic DNA isolated from M5Pax8 fibroblasts. Cells 
were transiently transfected with a Cre expression vector (lane 2) or 
treated for 18 h with either buffer alone (lane 3), 7.5 ug/ml VP22-Cre (4, 
5) or 15 ug/ml AVP22-Cre (6, 7) in serum-free medium. Four days after 
treatment, genomic DNA was extracted and subjected to PCR 
amplification. Deletion of the loxP-flanked DNA segment is indicated by the 
presence of the 226 bp DNA fragment. To confirm the presence of the Nco 
I restriction site in the recombined allele, the PCR products were digested 
with Nco I which produces two diagnostic fragments of 85bp and 141bp 
(lanes 5 and 7). Lane 1: 100 bp ladder (Life Technologies), lane 8: 1 kb 
ladder (Life Technologies). 

B: PCR strategy to detect Cre-mediated deletion of the loxP-flanked DNA 
segment. Arrows indicate the positions of the primers. 

Detailed Description of the Invention 

The expression "target sequences" according to the present invention 
means all kind of sequences which may be mutated (viz. deleted, 
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translocated, integrated and/or inverted) by the action of the 
recombinase. The number of RRS in the target sequence depends on the 
kind of mutation to be performed by the recombinase. For most of the 
mutations (especially for deletions and invertions) two RRS are required 
which are flanking the sequence to be mutated (deleted or inverted). For 
some kinds of integrations only one RRS may be necessary within the 
target sequence. 

The "living organisms" according to the present invention are multi-cell 
organisms and can be vertebrates such as mammals (e.g., rodents such 
as mice or rats) or non-mammals (e.g., fish) or can be invertebrates such 
as Insects or worms, or can be plants (higher plants, algi or fungi). Most 
preferred living organisms are mice and fish. 

"Cell culture" according to the present invention include cells isolated from 
the above defined living organism and cultured in vitro. These cells can be 
transformed (immortalized) or untransformed (directly derived from the 
living organism; primary cell culture). 

The site-specific DNA recombinase domain within the fusion protein of the 
invention of the present application is preferably selected from a 
recombinase protein derived from Cre, Flp, <|>C31 recombinase (Thorpe and 
Smith, Proc. Natl. Acad. Sci, USA, vol. 95, 5505-5510 (1998)), y5 
resolvase (Schwickardi and Droge, FEBS letters 471:147-150 (2000) and 
R recombinase (Araki et al., 3. Mol. Biol., 182, 191-203 (1985)). The 
preferred recombinases are Cre and mutants thereof (preferably the Cre 
variant of aa 15 to 357 of SEQ ID NO: 2 or aa 325-667 of SEQ ID NO: 6) 
and Flp and variants thereof including Flpe (preferably the Flp variant of 
aa 15 to 437 of SEQ ID NO: 4 or aa 325 to 747 of SEQ ID NO: 8). 



WO 01/49832 PCT7EP01/00060 

12 

The protein transduction domain according to the present invention 

includes, but is not limited to, the PTDs mentioned in Background of the 

Invention. The PTD preferably is derived from the VP22 protein of HSV or 

from the TAT protein of HIV. Suitable TAT proteins include, but are not 

limited to, proteins comprising (i) the amino acid sequence shown in SEQ 

ID NO: 10 and mutant thereof such as 

(ii) proteins comprising the amino acid 

AGRKKRRQRRR (SEQ ID NO: 22) 

YARKARRQARR (SEQ ID NO:23) 

YARAAARQARA (SEQ ID NO:24) 

YARAARRAARR (SEQ ID NO: 25) 

YARAARRAARA (SEQ ID NO:26) 

YARRRRRRRRR (SEQ ID NO: 27) 

YAAARRRRRRR (SEQ ID NO: 28) 

as known from WO 99/29721. Preferred are transduction domains 
consisting of the TAT proteins (i) and (ii) above. 

Suitable VP22 proteins include, but are not limited to, the wild-type VP22 
protein, i.e., a protein comprising amino acids 1 to 302 of SEQ ID No:21, 
and truncated forms thereof. Truncated VP22 proteins in accordance with 
the present invention can be those lacking 1 to 158 amino acid residues at 
their N-terminal end. The most preferred VP22 protein is the truncated 
VP22 PTD comprising amino acid residues 16 to 157 of SEQ ID NO: 14. 

The fusion of the two domains of the fusion protein can occur at any 
possible position, i.e., the protein transduction domain can be fused to the 
N- or C-terminal of the site-specific DNA recombinase or can be fused to 
active sites within the site-specific DNA recombinase. Preferably the 
protein transfusion domain is fused to the N-terminal of the site-specific 
DNA recombinase domain. 
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The protein transduction domain can be fused to the site-specific DNA 
recombinase either through a direct chemical bond or through a linker 
molecule. Such linker molecule can be any bivalent chemical structure 
capable of linking the two domains. The preferred linker molecule 
according to the present invention is a short peptide, e.g., having 1 to 20, 
preferably 1 to 10, amino acid residues. Specifically preferred short 
peptides are essentially consisting of Gly, Ala and/or Leu. 

The fusion protein of the invention of the present application may further 
comprise other functional sequences such as secretion conferring signals, 
nuclear localisation signals and/or signals conferring protein stabilisation. 

In case the fusion protein comprises a protein transduction domain 
derived from the TAT protein of HIV, the DNA sequence coding for said 
fusion protein preferably comprises the sequence 
5' TAC GGC CGC AAG AAG CGC CGC CAA CGC CGC CGC 3'. 

Such a preferred DNA sequence is for instance shown in SEQ ID NO: 11. 
In said sequence the 3' terminal codon ggc codes for the linker Gly. The 
DNA sequence of a suitable recombinase may be directly attached to said 
codon ggc. 

The fusion protein can be obtained by the following steps: 

1. Fusion of the recombinase coding region (e.g. encoding Cre: see amino 
acids 15 to 357 of SEQ ID NO: 2) with the sequence conferring protein 
translocation (e.g. the sequence encoding the TAT peptide 
YGRKKRRQRRR, SEQ ID NO: 10) using standard cloning protocols 
(Maniatis et al., Cold Spring Harbor Laboratory, New York (1989)) or 
chemical synthesis. 
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2. Generation of a construct for the expression of the fusion protein in 
prokaryotic or eukaryotic cells, e.g. in E. coli DH5a (Hanahan, J. Mol. 
Biol.; 166(4): 557-80 (1983)) using the QIAexpress pQE vector (Qiagen, 
Hilden). 

3. Expression of the above mentioned fusion protein in prokaryotic or 
eukaryotic cells, e.g. in E. coli DH5a (Hanahan, 1983) 

4. Extraction and purification of the above mentioned fusion protein e.g. 
as described in Nagahara et al., Nat. Med., 4(12): 1449-52 (1998). 

In an experiment it was shown that TAT-mediated delivery of active Cre 
protein works with sufficient efficacy to facilitate inducible gene targeting 
both in cell lines and living organisms. In this experiment a vector for the 
expression of a TAT-Cre fusion protein in E. coli was constructed, TAT-Cre 
protein was expressed in E. coli and purified from bacterial lysates. To test 
the activity of the TAT-Cre protein in vitro, a reporter cell line that 
contains a loxP-containing reporter construct was used. This reporter, 
when recombined by Cre recombinase, allows the expression of a B- 
galacosidase gene. Further, a transgenic mouse strain carrying a loxP- 
flanked target was used to invest the activity of the TAT-Cre protein in " 
vivo. 

In a second experiment it was shown that VP22-mediated delivery of 
active Cre protein works with sufficient efficacy to facilitate Inducible gene 
targeting. In this experiment Bacterial expression vectors were 
constructed for the production of VP22-Cre fusion proteins in E. coli. The 
activity of purified VP22-Cre proteins were tested using a reporter 
fibroblast cell line containing a loxP-flanked reporter construct. 

Thus, the injection of the purified fusion protein of the present invention 
into a living organism (e.g., a mouse) carrying a gene comprising the 
RRS-flanked target sequence (e.g., in an amount of 1 to 200, preferably 5 
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to 50 jjg per g body weight). To demonstrate the feasibility of the 
invention, a reporter mouse strain carrying an RRS-flanked cassette was 
used (Thorey et al., Mol. Cell Biol., 18(10):6164 (1998)). 

Analysis Is achieved by determining the pattern of induced target gene 
recombination (e.g. through PCR analysis, Southern blot analysis or X-Gal 
staining on tissue sections; Maniatis et al., 1989; Gossler and Zachgo, 
Joyner AL (Ed.), Oxford University Press, Oxford, New York (1993)). 

The procedure's advantages over current technology are as follows: 

/" i 

(i) The absence of background recombination before administration of 
the fusion protein. 

(ii) The reduction of time and resources which are necessary to combine 
the recombinase transgene and two copies of the RRS-flanked target 
gene by conventional breeding. 

In experiments it was shown the following: (a) With a suitable vector for 
the expression of a TAT-Cre fusion protein, a TAT-Cre fusion protein was 
expressed in E. coli and purified from bacterial lysates. 

(b) A reporter cell line containing a loxP-containing reporter construct was 
used to test the activity of the TAT-Cre protein in vitro. This reporter, 
when recombined by Cre recombinase, allows the expression of a B- 
galacosidase gene. 

(c) A transgenic mouse strain carrying a loxP-fianked target was used to 
invest the activity of the TAT-Cre protein in vivo. 

These experiments demonstrate that TAT-mediated delivery of active Cre 
protein works with sufficient efficacy to facilitate inducible gene targeting 
both in cell lines and living organisms. 
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Furthermore, bacterial expression vectors were constructed for the 
production of VP22-Cre fusion proteins in E. coli. The activity of purified 
VP22-Cre proteins were tested using a reporter fibroblast cell line 
containing a loxP T flanked reporter construct. These experiments 
demonstrate that VP22-mediated delivery of active Cre protein works with 
sufficient efficacy to facilitate inducible gene targeting. 

The invention is further illustrated by the following, non-limitative 
examples. 

Examples 

Materials and Methods 

Construction of pT7-TACS: The TAT-Cre coding region was generated by 
PCR using Advantage-HF PCR Kit (Clontech), 20 pmol of the primers 
TATcre sense (5'-atg cca tgg get acg gec gca aga age gec gec aac gec gec 
,; ■' gcg gca tgt cca att tac tga ccg tac acc-3'; SEQ ID NO:31) and TATcre 
antisense (5'-ttt egg ate cgc cgc ata acc agt g-3'; SEQ ID NO:32) and 10 
ng pCMV-I-Cre-pA (see SEQ ID NO: 29) as template. The PCR reaction was 
performed using the following cycle profile: 2' 94 °C, 4 x (30" 94 °C min, 
30" 50 °C, 1' 72 °C), 12 x (30" 94 °C min, 30" 55 °C, 1' 72 °C) and 10' 
72 °C. The resulting PCR fragment was digested with Nco I and BamH I, 
treated with Klenow enzyme and ligated into the plasmid pBSII KS+ which 
had been opened with restriction enzyme BamH I, treated with Klenow 
and dephosphorylated with calf intestinal phosphatase. The resulting 
plasmid pBS TAT- 5 'cre was verified by DNA sequencing. The Plasmid 
pCMV-I-Cre-pA (SEQ ID NO: 29) was digested with Age I and Sal I which 
released a 1,036 kb fragment containing the 3' part of the Cre coding 
region. This fragment was ligated into the plasmid pBS TAT-5'cre which 
had been opened with Age I and Sal I. 
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10 ng pBS-TATCre was subjected to PCR amplification using 20 pmol of 
primers FPA001 (5'-tat ate tag acc atg ggc tac ggc cgc aag aag c-3'; SEQ 
ID NO: 33) and FPA002 (5'-gct acc acg acc ttc gat acc ate gec ate ttc cag 
cag gcg c-3'; SEQ ID NO: 34). PCR was performed using 2,5 U Platinum 
Pfx DNA polymerase (Gibco BRL) and 2 x Enhancer Solution (Gibco BRL) 
according to the manufacturers protocol. The following cycle profile was 
used: 2' 94 °C, 25 x (30" 94 °C min, 15" 54,6 °C, 2'30" 68 °C). The 
amplified PCR fragment was purified using GFX columns (Amersham 
Pharmacia), digested with Xba I and ligated into the plasmid pASK57 
(Skerra and Arne, Gene 151: 131-135 (1994)) which had been opened 
with restriction enzymes Xba I and Eco 47 III and dephosphorylated with 
calf intestinal phosphatase. The resulting plasmid pASK75-TACS was 
digested with restriction enzymes Nco I and Hind III which released a 1,1 
kb fragment. The fragment was subsequently ligated into the plasmid pT7- 
7 (Studier and Moffatt, J. Mol. Biol. 189: 113-130 (1986)) which had been 
opened with restriction enzymes Nco I and Hind III and dephosphorylated 
with calf intestinal phosphatase resulting In the plasmid pT7-TACS (SEQ ID 
NO: 16). 

Construction of DT7-VPCS: The Cre coding region was generated by PCR 
using Advantage-HF PCR Kit (Clontech), 20 pmol of the primers VP22cre 
sense (5'-taa eta gcg gee gca tgt cca att tac tga ccg tac ac-3'; SEQ ID 
NO: 35) and VP22cre antisense (5'-tcg age ggc cgc cat cgc cat ctt cca gca 
ggc g-3'; SEQ ID NO:36) and 10 ng pgkcre-pA (SEQ ID NO:40) as 
template. The PCR reaction was performed using the following cycle 
profile: 2* 94 °C, 5 x (30" 94 °C, 30" 50 °C, 2' 72 °C), 15 x (30" 94 °C, 
30" 55 °C, 2' 72 °C) and 10' 72 °C. The resulting PCR fragment was 
digested with Not I and ligated into the plasmid pVP22/Myc-His 
(Invitrogen), which had been opened with restriction enzyme NotI, 
dephosphorylated with calf intestinal phosphatase. The resulting plasmid 
pVP22-cre myc/His was verified by DNA sequencing. 
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10 ng pVP22-cre myc/His was subjected to PCR amplification using 20 
pmol of primers FPA004 (5'-tat ate tag aca tat gac etc teg ccg etc cg-3'; 
SEQ ID NO:37) and FPA002 (SEQ ID NO:34). PCR was performed using 
2,5 U Platinum Pfx DNA polymerase (Gibco BRL) and 2 x Enhancer 
Solution (Gibco BRL) according to the manufacturers protocol. The 
following cycle profile was used: 2' 94 °C, 25 x (30" 94 °C min, 15" 54,6 
°C, 2'30" 68 °C). The amplified PCR fragment was purified using GFX 
columns (Amersham Pharmacia), digested with Xba I and ligated into the 
plasmid pASK57 (Skerra and Arne, Gene 151: 131-135 (1994)) which had 
been opened with restriction enzymes Xba I and Eco 47 III and 
dephosphorylated with calf intestinal phosphatase. The resulting plasmid 
PASK75-VPCS was digested with restriction enzymes Nde I and Hind III 
which released a 2,0 kb fragment. The fragment was subsequently ligated 
into the plasmid pT7-7 (Studier and Moffatt, J. Mol. Biol. 189: 113-130 
(1986)) which had been opened with restriction enzymes Nde I and Hind 
III and dephosphorylated with calf intestinal phosphatase resulting in the 
plasmid pT7-VPCS (SEQ ID NO: 17). 

Construction of DCRT7-AVPCS: The AVP22-Cre coding region was 
generated by PCR using Platinum Pfx DNA polymerase (Life Technologies), 
20 pmol of the primers FPA007 (5'-ttc cga aga cga cga aac acc-3'; SEQ ID 
NO:38) and FPA008 (5'-tat att cga age tta tta acc acc gaa ctg cg-3'; SEQ 
ID NO: 39) and 30 ng pT7-VPCS (SEQ ID NO: 17) as template. The PCR 
reaction was performed using the following cycle profile: 2' 94 °C, 25 x 
(30" 94 °C, 30" 61 °C, 2'30" 68 °C) and T 68 °C. The resulting 1,8 kb 
PCR fragment was digested with Nco I and Sfu I and ligated into the 
plasmid pCRT7/VP22-l (Invitrogen), which had been opened with 
restriction enzymes Nco I and Sfu I, and dephosphorylated with calf 
intestinal phosphatase. The resulting plasmid pCRT7-AVPCS (SEQ ID 
NO: 15) was verified by DNA sequencing. 
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Expression of the fusion proteins in E. coli: E. coli BL21(DE3)-RIL cells 
(Stratagene) were transformed with pT7-TACS and grown on LB agar 
plates containing 100 ug/ml ampicillin. E. coli BL21(DE3)-RP cells 
(Stratagene) were transformed with pT7-VPCS and grown on LB agar 
plates containing 100 ug/ml ampicillin. E. coli BL21(DE3)-pLysS 
(Inyitrogen) were transformed with pCRT7-AVPCS and grown on LB agar 
plates containing 25 ug/ml kanamycine and 34 ug/ml chloramphenicol. 
Single colonies were isolated and used to prepare glycerol stocks. Eight 
5ml LB (Lura Bertani) aliquots containing antibiotics were inoculated with 
stabs from the glycerol stocks and grown overnight at 37°C with shaking. 
Two 5ml overnight cultures were each used to inoculate one of four 1L LB 
aliquots containing antibiotics and grown at 37°C with shaking. Growth 
rate was monitored by spectrophotometry at 578nm. When the cultures 
had obtained an OD578 = 0,5 expression of the fusion proteins were 
induced by the addition of 0,5 mM Isopropyl-B-D-l-thiogalactopyranosid 
(IPTG). Two hours after induction cells were harvested by centrifugation at 
12000xg and the pellet rapidly frozen in liquid nitrogen and stored 
immediately at -80°C. 

Purification of the fusion proteins from bacterial Ivsates; Each lOg cell 
pellet was resuspended on ice in 30ml Bicine buffer (50mM Bicine, pH 8,5) 
including one protease inhibitor tablet (Complete, Roche). Cells were lysed 
through threefold treatment (1500psi, 5 minutes) with the cell disruption 
bomb (Parr Instrument). 30ml of Benzonase (10000U, Merck) was added 
and cell extracts were incubated for 30 minutes at 4°C. Cell extracts were 
then centrifuged at i2,000xg (4°C). The pellet was redissolved in 8M 
urea, 50mM Bicine, lOOmM DTT, pH 8,5 by incubation for 16 hours at 
4°C. Protein extract was centrifuged at 31000xg and supernatant 
harvested. Protein extract was diluted in an equal volume of 
Chromatography buffer A (50mM Bicine, pH 8,5). PH was adjusted to pH 
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8,5 and the extract was filtered through a 0,45pm filter (Millipore). FPLC 
(Akta Explorer, Amersham Pharmacia) was performed using a cation 
exchange column (Sepharose SP, Column body HR_5/5 (0.5 x 5cm), 
column volume (CV) 1ml, linear flow 300cm/hour, Amersham Pharmacia). 
After addition of sample to FPLC column, buffer was exchanged with 
Chromatography buffer A at 10 CV. 

TAT-Cre and VP22-Cre fusion proteins were eluted from the column by 
gradient elution using chromatography buffer B (50mM Bicine, 1M NaCI, 
pH 8,5) using the following profile: 0 - 50 % buffer B, 0 CV; 50 % buffer 
B, 10 CV; 50 - 100 % buffer B (linear gradient), 20 CV; 100 % buffer B, 
10 CV. AVP22-Cre protein was eluted from the column by gradient elution 
using the following profile: 0 - 10 % buffer B, 0 CV; 10 % buffer B, 10 CV; 
10 - 30 % buffer B, 0 CV; 30 % buffer B, 10 CV; 30 - 100 % buffer B, 0 
CV; 100 % buffer B, 10 CV. Three 1,5ml fractions each containing purified 
fusion proteins were collected. Purity and concentration of protein 
fractions were determined by Coomassie blue stained SDS-PAGE gels and 
Western blot analysis using dilutions of BSA standard solutions. In addition 
protein content was determined using a Bradford assay (Coomassie Plus 
protein assay, Pierce). 

SDS-PAGE and Western blot analysis: SDS-PAGE and Coomassie staining 
was performed according to standard protocols (Maniatis et al., Cold 
Spring Harbor Laboratory, New York (1989)) using 4-12 % gradient 
SDS-polyacrylamide gels (NuPAGE, Invitrogen, cat. no.: NP0321). 
Western blot analysis was performed using a Semi-Try Blotting Chamber 
(Biorad) and nitrocellulose membranes (0,2 urn; Schleicher & Schuell) 
according to the manufacturers protocols. The fusion proteins were 
detected by using an alkaline phosphatase-conjugated anti-strep tag 
antibody (IBA, Cat. No.: 2-1503-001) according to the manufacturers 
protocol. 
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Generation of the M5P g X 8 Cre reporter cell line: The SV40-transformed 
murine embryonic fibroblast line MEF5/5 (Schwenk et al., Nucl Acids Res 
26(6), 1427-32 (1998)) was transfected with the vector pPGKpaXl 
(Kellendonk et at, Nucl. Acids Res. 24, 1404-11 (1996)). 10 6 MEF5/5 cells 
were electroporated with 20 ug pPGKpaXl plasmid DNA linearised with 
Sea I and plated into 48-well-plates. The cells were cultured in 
DMEM/Glutamax medium (Life Technologies) supplemented with 10 % 
fetal calf serum at 37°C, 10 % C0 2 in humid atmosphere. Two days after 
transfection the medium was supplemented with 5 pg/ml puromycine 
(Calbiochem) for the selection of stable integrants. 14 puromycine- 
resistant clones were expanded and tested by transien transfection with 
the Cre expression vector pPGK-Cre-pA (SEQ ID NO: 40). In two out of 
the 14 puromycine-resistant clones, the expression of B-galactosidase 
could be detected by staining with X-Gal. One of these clones, M5Pax8, 
was used as Cre reporter cell line. 

Transfectio n and measurement of B-aalactosidase activity: Fibroblasts 
(10 6 cells per 24 well plate (Falcon)) were transfected with 25 ng pCMV-I- 
Cre-pA (see SEQ ID NO:29) or pCMV-I-B-pA (see SEQ ID NO:30) plasmids 
using the FuGene transfection reagent (Roche Diagnostics). After 2 days 
the cells were lysed and the B-galactosidase activities were determined 
with the B-galactosidase reporter gene assay (Roche. Diagnostics) 
according to the manufacturers guidelines using a Lumistar luminometer 
(MWG). 

Histochemi cal detection of B-aalactosidase activity: To quantitate B- 
galactosidase expression, fibroblast cells were washed once with 
phosphate buffered saline (PBS), and the cells were fixed for 5 minutes at 
room temperature in a solution of 4% formaldehyde in PBS. Next, the cells 
were washed twice with PBS and finally incubated in staining solution for 
24 hours at 37°C (staining solution: 5 mM K3(Fe(CN)6), 5mM 
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K4(Fe(CN)6), 2mM MgCI2, lmg/ml X-Gal (BioMol) in PBS). Blue stained, 
B-galactosidase positive cells were detected and distinguished from 
negative (transparent) cells in a cell culture binocular microscope under 
200x magnification. For each determination a minimum of 200 cells was 
counted. 

PCR detection of Cre-m ediated recombination: Genomic DNA extracted 
from tissue samples was subjected to PCR using Taq-polymerase (Gibco 
BRL Cat. No. 10342-020) using 20 pmol of each primer (sense: S'-CAT 
CTC CGG GCC TTT CGA CCT G - 3', antisense: 5' -GCG ATC GGT GCG 
GGC CTC TTC - 3'; SEQ ID Nos: 41 and 42, respectively). PCR was 
performed using the following cycle profile: 2' 94°C, 35 x (30" 94°C, 30" 
55 °C, 1' 72 °C), 10 min 72 °C. PCR products were separated on a 1,2 % 
agarose gel. 

Example 1 

The vector pT7-TACS (SEQ ID NO: 16) was constructed for the expression 
of a TAT-Cre fusion protein in E. coli. The plasmid contains the coding 
region of the 11 amino acid protein transduction domain of the wild-type 
HIV TAT protein (Green and Loewenstein, Cell, 55(6): 1179-88 (1988); 
Frankel and Pabo, Cell, 55(6): 1189-93 (1988); SEQ ID NO: 10) fused to 
the N-terminus of Cre recombinase protein sequence. The 10-amino-acid 
strep tag at the C-terminus allows the detection and purification of the 
fusion protein using specific antibodies (Schmidt and Skerra, J. 
Chromatogr A 676: 337-345 (1994)). The protease factor Xa recognition 
site (Ile-Glu-Gly-Arg) permits the removal of the strep tag by proteolytic 
cleavage. The estimated molecular weight of the TAT-Cre fusion protein is 
42 kDa. A scheme of the TAT-Cre expression vector is depicted in figure 2. 
For the expression of TAT-Cre, the E. coli strain BL21(DE3)-RIL 
(Stratagene) was used. This strain carries an IPTG-inducible T7 
polymerase gene and additional copies of the tRNA genes for the Yare 



WO 01/49832 PCT/EP01/00060 

23 

codons' argl), ileY and leuW. 

E. coli BL21(DE3)-RIL cells were transformed with pT7-TACS and grown in 
LB medium containing 100 ug/ml ampicillin. The expression of the 40 kDa 
TAT-Cre fusion protein could be strongly induced by the addition of 0,5 
mM IPTG to the culture medium. Analysis of protein lysates revealed that 
approximately 50 % of TAT-Cre protein accumulated as insoluble inclusion 
bodies. The inclusion bodies where extracted and dissolved in 8 M urea. 
TAT-Cre was subsequently purified from this fraction using ion exchange 
chromatography. The quantity and purity of TAT-Cre protein was 
determined using Coomassie stained SDS-PAGE gels and Western blot 
analysis (figure 3). The purification process yielded TAT-Cre protein 
extracts of 64 % purity and a concentration of 100 ug/ml. 
To analyse the ability of the purified TAT-Cre protein to transduce into 
cultured cells, we used the fibroblast cell line M5Pax8 (R. Kuhn, 
unpublished) that contains a loxP-containing reporter construct. This 
reporter, when recombined by Cre recombinase, allows the expression of 
a B-galacosidase gene (Buchholz et al, Nucleic Acids Res. 24, 4256-4262, 
1996). Cells were cultured for 18 h with increasing concentrations of TAT- 
Cre protein in serum-free medium and analysed 4 days later for B- 
Galacosidase activity. Staining with X-Gal showed that > 50 % of the cells 
treated with 13,8 ug/ml TAT-Cre protein expressed B-galactosidase 
indicating recombination of the loxP-flanked reporter construct had 
occurred (figure 4). Measurement of B-galactosidase activity in cell 
lysates revealed an up to 30-fold higher level of B-galactosidase activity 
in comparison to cells which had been transiently transfected with an 
eukaryotic Cre expression vector (figure 5). 

To investigate the actjvity of TAT-Cre protein in a living organism, we used 
a transgenic mouse strain carrying a loxP-flanked target for Cre-mediated 
recombination (Thorey et al., 1998, Mol. Cell. Biol. 18: 3081 - 3088). Mice 
where treated three times with intraperitoneal injections of 75 ug TAT Cre 
protein at two-day-intervals and analysed 2 days later. Genomic DNA was 
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isolated from a variety of organs and subjected to PCR amplification which ' 
specifically amplifies a 400 bp fragment of the recombined allele. The 
deleted allele could be detected in multiple tissues from treated mice 
indicating TAT-Cre-mediated recombination in these organs (figure 6). 
This experiments demonstrates that TAT-mediated delivery of active Cre 
protein works with sufficient efficacy to facilitate inducible gene targeting 
in cell lines and in living organisms. 

Example 2 

The vectors pT7-VPCS (SEQ ID NO: 17) and pCRT7-AVPCS (SEQ ID NO: 15) 
were constructed for the expression of VP22-Cre and AVP22-Cre fusion 
proteins in E. coli. The VP22-Cre gene of pT7-VPCS contains the full length 
protein translocation domain of the HSV VP22 protein (Elliott and O'Hare, 
Cell, 88(2): 223-33 (1987), whereas the AVP22-Cre gene of pCRT7-AVPCS 
contains a truncated VP22 protein transduction domain (amino acids 159 - 
301; Invitrogen; aa 16-157 of SEQ ID NO:14) fused to the N-termlnus of 
Cre recombinase protein sequence. A 10-amino-acid strep tag at the C- 
terminus of Cre protein sequence allows the detection and purification of 
the fusion proteins using specific antibodies (Schmidt and Skerra, J. 
Chromatogr A 676: 337-345 (1994)). The protease factor Xa recognition 
site permits the removal of the Strep tag by proteolytic cleavage. The 
estimated molecular weight is 75 kDa for VP22-Cre protein and 60 kDa for 
AVP22-Cre protein. A scheme of the vectors pT7-VPCS and pCRT7-AVPCS 
is depicted in figure 7. 

E. coli BL21(DE3)-RIP cells (Stratagene) were transformed with pT7-VPCS 
and cultured in LB medium containing 100 ug/ml ampicillin. E. coli 
BL21(DE3)-pLysS cells (Stratagene) were transformed with pCRT7-AVPCS 
and cultured in LB medium containing 25 ug/ml kanamycine and 34 ug/ml 
chloramphenicol. Expression of the VP22-Cre and AVP22-Cre fusion 
proteins could be induced by the addition of 0,5 mM IPTG to the culture 
medium. Analysis of protein extracts using Coomassie staining and 
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Western blotting of SDS-PAGE gels revealed that 50 - 60 % of VP22-Cre 
and AVP22-Cre proteins accumulated as insoluble inclusion bodies. The 
inclusion bodies where extracted and dissolved in 8 M urea. VP22-Cre and 
AVP22-Cre fusion proteins were subsequently purified using ion exchange 
chromatography. The quantity and purity of the isolated VP22-Cre and A 
VP22-Cre fusion proteins was determined using Coomassie stained SDS- 
PAGE gels and Western blot analysis (figure 8). 
To analyse the ability of the purified fusion proteins to transduce into 
cultured cells, we used the fibroblast cell line M5Pax8 that contains a loxP- 
containing reporter construct. When recombined by Cre recombinase, the 
reporter allows the expression of a B-galacosidase gene (Buchholz et al, 
Nucleic Acids Res. 24, 4256-4262, 1996). The cells where cultured for 18 
h with increasing concentrations of VP22-Cre and AVP22-Cre in serum-free 
medium and analysed 4 days later for 6-Galacosidase activity. Staining 
with X-Gal showed ~2 % blue cells in the cultures treated with up to 15 
ug/ml AVP22-Cre indicating recombination of the loxP-flanked reporter 
construct had occurred. In contrast, cell cultures treated with up to 0,5 
ug/ml VP22-Cre did not show any X-gal staining (figure 9). Measurement 
of cell lysates revealed a strong increase of B-galactosidase activity upon A 
VP22-Cre treatment when compared to untreated cells (figure 10). 
Genomic DNA was isolated fand subjected to PCR amplification that 
specifically amplifies a 250 bp fragment of the recombined allele. The 
deleted allele could be detected in cells treated with both VP22-Cre and A 
VP22-Cre fusion proteins (figure 11). 

This experiment demonstrates that VP22-mediated delivery of active Cre 
protein works with sufficient efficacy to facilitate inducible gene targeting. 
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SEQUENCE LISTING 
<110> ARTEMIS Pharmaceuticals GmbH 

<120> Transduction of recombinases for inducible gene 
targeting 

<130> 010007wo/JH/ml 

<140> 
<141> 

<160> 42 

<170> Patentln Ver. 2.1 



<210> 1 
<211> 1074 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TAT-Cre 

<220> 

<221> CDS 

<222> (1) . . (1071) 

<400> 1 

atg ggc tac ggc cgc aag aag cgc cgc caa cgc cgc cgc ggc atg tec 
Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
1 5 10 15 

aat tta ctg acc gta cac caa aat ttg cct gca tta ccg gtc gat gca 
Asn Leu Leu Thr Val His Gin Asn Leu Pro Ala Leu Pro Val Asp Ala 
20 25 30 

acg agt gat gag gtt cgc aag aac ctg atg gac atg ttc agg gat cgc 
Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg Asp Arg 
35 40 45 

cag gcg ttt tct gag cat acc tgg aaa atg ctt ctg tec gtt tgc egg 
Gin Ala Phe Ser Glu His Thr Trp Lys Met Leu Leu Ser Val Cys Arg 
50 55 60 



ctg gca gta aaa act ate cag caa cat ttg ggc cag eta aac atg ctt 
Leu Ala Val Lys Thr lie Gin Gin His Leu Gly Gin Leu Asn Met Leu 
100 105 110 

cat cgt egg tec ggg ctg cca cga cca agt gac age aat get gtt tea 
His Arg Arg Ser . Gly Leu Pro Arg Pro Ser Asp Ser Asn Ala Val Ser 
115 120 125 



48 



96 



144 



192 



teg tgg gcg gca tgg tgc aag ttg aat aac egg aaa tgg ttt ccc gca 240 

Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn Arg Lys Trp Phe Pro Ala 
65 70 75 80 

gaa cct gaa gat gtt cgc gat tat ctt eta tat ctt cag gcg cgc ggt 288 

Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gin Ala Arg Gly 
85 90 95 



336 



384 
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ctg gtt atg egg egg ate cga aaa gaa aac gtt gat gec ggt gaa cgt 432 
Leu Val Met Arg Arg lie Arg Lys Glu Asn Val Asp Ala Gly Glu Arq 
130 135 140 

gca aaa cag get eta gcg ttc gaa cgc act gat ttc gac cag gtt cgt 480 
Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr Asp Phe Asp Gin Val Arg 
145 , 150 155 1 160 

tea etc atg gaa aat age gat cgc tgc cag gat ata cgt aat ctg gca 528 
Ser Leu Met Glu Asn Ser Asp Arg Cys Gin Asp lie Arg Asn Leu Ala 
165 170 175 

ttt ctg ggg att get tat aac acc ctg tta cgt ata gee gaa att gec 576 
Phe Leu Gly lie Ala Tyr Asn Thr Leu Leu Arg lie Ala Glu lie Ala 
180 185 J 190 

agg ate agg gtt aaa gat ate tea cgt act gac ggt ggg aga atg tta 624 
Arg lie Arg Val Lys Asp He Ser Arg Thr Asp Gly Gly Arg Met Leu 
195 200 205 

ate cat att ggc aga acg aaa acg ctg gtt age acc gca ggt gta gag 672 
He His He Gly Arg Thr Lys Thr Leu Val Ser Thr Ala Gly Val Glu 
210 215 220 

aag gca ctt age ctg ggg gta act aaa ctg gtc gag cga tgg att tec 720 
Lys Ala Leu Ser Leu Gly Val Thr Lys Leu Val Glu Arg Trp He Ser 
225 230 235 - * 240 

gtc tct ggt gta get gat gat ccg aat aac tac ctg ttt tgc egg gtc 768 
Val Ser Gly Val Ala Asp Asp Pro Asn Asn Tyr Leu Phe Cys Arg Val 
245 250 255 

aga aaa aat ggt gtt gec gcg cca tct gee acc age cag eta tea act 816 
Arg Lys Asn Gly Val Ala Ala Pro Ser Ala Thr Ser Gin Leu Ser Thr 
260 265 270 

cgc gee ctg gaa ggg att ttt gaa gca act cat cga ttg att tac ggc 864 
Arg Ala Leu Glu Gly He Phe Glu Ala Thr His Arg Leu He Tyr Gly 
275 280 285 

get aag gat gac tct ggt cag aga tac ctg gee tgg tct gga cac agt 912 
Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu Ala Trp Ser Gly His Ser 
290 295 300 

gee cgt gtc gga gee gcg cga gat atg gee cgc get gga gtt tea ata 960 
Ala Arg Val Gly Ala Ala Arg Asp Met Ala Arg Ala Gly Val Ser He 
305 310 315 " 320 

ccg gag ate atg caa get ggt ggc tgg acc aat gta aat att gtc atg ' 1008 
Pro Glu He Met Gin Ala Gly Gly Trp Thr Asn Val Asn He Val Met 
325 330 335 

aac tat ate cgt aac ctg gat agt gaa aca ggg gca atg gtg cgc ctg 1056 
Asn Tyr He Arg Asn Leu Asp Ser Glu Thr Gly Ala Met Val Arg Leu 
340 345 350 

ctg gaa gat ggc gat tag 1074 
Leu Glu Asp Gly Asp 
355 



<210> 2 
<211> 357 
<212> PRT 
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<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TAT-Cre 

<400> 2 

Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
15 10 15 

Asn Leu Leu Thr Val His Gin Asn Leu Pro Ala Leu Pro Val Asp Ala 
20 25 30 

Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg Asp Arg 
35 40 45 

Gin Ala Phe Ser Glu His Thr Trp Lys Met Leu Leu Ser Val Cys Arg 
50 55 60 

Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn Arg Lys Trp Phe Pro Ala 
65 70 75 80 

Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gin Ala Arg Gly 
85 90 95 

Leu Ala Val Lys Thr lie Gin Gin His Leu Gly Gin Leu Asn Met Leu . ' 
100 105 110 

His Arg Arg Ser Gly Leu Pro Arg Pro Ser Asp Ser Asn Ala Val Ser 
115 120 ~ 125 

Leu Val Met Arg Arg lie Arg Lys Glu Asri Val Asp Ala Gly Glu Arg 
130 135 140 

Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr Asp Phe Asp Gin Val Arg 
145 150 ~ 155 160 

Ser Leu Met Glu Asn Ser Asp Arg Cys Gin Asp He Arg Asn Leu Ala 
165 170 175 

Phe Leu Gly He Ala Tyr Asn Thr Leu Leu Arg He Ala Glu He Ala 
180 185 190 

Arg He Arg Val Lys Asp He Ser Arg Thr Asp Gly Gly Arg Met Leu 
195 200 205 

He His He Gly Arg Thr Lys Thr Leu Val Ser Thr Ala Gly Val Glu 
210 215 220 

Lys Ala Leu Ser Leu Gly Val Thr Lys Leu Val Glu Arg Trp He Ser 
225 230 235 240 

Val Ser Gly Val Ala Asp Asp Pro Asn Asn Tyr Leu Phe Cys Arg Val 
245 250 255 

Arg Lys Asn Gly Val Ala Ala Pro Ser Ala Thr Ser Gin Leu Ser Thr 
260 265 270 

Arg Ala Leu Glu Gly He Phe Glu Ala Thr His Arg Leu He Tyr Gly 
275 280 285 

Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu Ala Trp Ser Gly His Ser 
290 295 " 300 

Ala Arg Val Gly Ala Ala Arg Asp Met Ala Arg Ala Gly Val Ser He 
305 310 315 320 
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Pro Glu He Met Gin Ala Gly Gly Trp Thr Asn Val Asn He Val Met 
325 330 335 

Asn Tyr He Arg Asn Leu Asp Ser Glu Thr Gly Ala Met Val Arg Leu 
340 345 350 

Leu Glu Asp Gly Asp 
355 



<210> 3 
<211> 1317 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TAT-Flpe 

<220> 

<221> CDS 

<222> (1) (1311) 

<400> 3 

atg ggc tac ggc cgc aag aag cgc cgc caa cgc cgc cgc ggc atg agt 48 

Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
15 10 15 

caa ttt gat ata tta tgt aaa aca cca cct aag gtc ctg gtt cgt cag 96 
Gin Phe Asp He Leu Cys Lys Thr Pro Pro Lys Val Leu Val Arg Gin 
20 25 30 

ttt gtg gaa agg ttt gaa aga cct tea ggg gaa aaa ata gca tea tgt 14 4 
Phe Val Glu Arg Phe Glu Arg Pro Ser Gly Glu Lys He Ala Ser Cys 
35 40 45 

get get gaa eta acc tat tta tgt tgg atg att act cat aac gga aca 192 
Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met He Thr His Asn Gly Thr 
50 55 60 

gca ate aag aga gec aca ttc atg age tat aat act ate ata age aat 240 
Ala He Lys Arg Ala Thr Phe Met Ser Tyr Asn Thr He He Ser Asn 
6 5 70 75 80 

teg ctg agt ttc gat att gtc aac aaa tea etc cag ttt aaa tac aag 288 
Ser Leu Ser Phe Asp He Val Asn Lys Ser Leu Gin Phe Lys Tyr Lys 
85 90 95 

acg caa aaa gca aca att ctg gaa gec tea tta aag aaa tta att cct 336 
Thr Gin Lys Ala Thr He Leu Glu Ala Ser Leu Lys Lys Leu He Pro 
100 105 110 

get tgg gaa ttt aca att att cct tac aat gga caa aaa cat caa tct 384 
Ala Trp Glu Phe Thr He He Pro Tyr Asn Gly Gin Lys His Gin Ser 
115 120 125 

gat ate act gat att gta agt agt ttg caa tta cag ttc gaa tea teg 432 
Asp He. Thr Asp lie Val Ser Ser Leu Gin Leu Gin Phe Glu Ser Ser 
130 135 140 

gaa gaa. gca gat aag gga aat age cac agt aaa aaa atg ctt aaa gca 480 
Glu Glu Ala Asp Lys Gly Asn Ser His Ser Lys Lys Met Leu Lys Ala 
145 150 155 ~ 160 
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Su T*n c 9 * rf 9 ag ° atC tgg ga 9 atc act aaa ata eta 

Leu Leu Ser Glu Gly Glu Ser lie Trp Glu He Thr Glu Lys lie Leu 



165 !7o 



175 



£™ I 9 o 2f g tat acc tcg aga ttfc aca aaa aca aaa act tta tac 
Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr Lys Thr Lys Thr Leu Tyr 
180 185 * 



190 



His 

395 400 



2? ^ ag f aag ggt agt gct gaa gg a a 9 c ata c g a ta c c CC gca 

He Glu Gin Leu Lys Gly Ser Ma Glu Gly Ser He Arg Tyr Pro Ala 
405 410 ' 415 



528 



576 



G 3 n So j tC ok° T ta ??* 3Ct ttC atc aat tgt gga aga ttc age gat 624 
Gin Phe Leu Phe Leu Ala Thr Phe He Asn Cys Gly Arg Phe Ser Asp 
195 200 205 

ill III a 30 S*? l at t Cg aaa tCa ttfc aaa tta g fcc °aa aat aag tat 672 
He Lys Asn Val Asp Pro Lys Ser Phe Lys Leu Val Gin Asn Lys Tyr 
210 215 220 

Leu Glv vif I?! t\ C rf 9 T " a 9tg 3Ca gag 3Ca aa g aca a 9c gtt 720 
Leu Gly Val He He Gin Cys Leu Val Thr Glu Thr Lys Thr Ser Val 

225 23 0 235 240 

lit III « ac a ^ a t ao «u c t ^ t agc gca ag g ggt a gg atc g at cca <=tt 768 

Ser Arg His lie Tyr Phe Phe Ser Ala Arg Gly Arg He Asp Pro Leu 
245 250 " 255 

v= a ^ r tg gat 9aa ttfc ttg agg aat tct gaa cca gtc eta aaa cga 816 
Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser Glu Pro Val Leu Lys Arg 
260 265 270 

399 a u° 2?° aat tCt tca agc aac aaa ca 5 gaa tac caa tta 864 
Val Asn Arg Thr Gly Asn Ser Ser Ser Asn Lys Gin Glu Tyr Gin Leu 
275 280 285 

tta aaa gat aac tta gtc aga tcg tac aac aag gct ttg aag aaa aat 912 
Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn Lys Ala Leu Lys Lys Asn 
290 295 300 

a?5 p ct l at n ca a ^ c gct ata aag aat gg° cca aaa tc t c ac «tt geo 

Ala Pro Tyr Pro He Phe Ala He Lys Asn Gly Pro Lys Ser His He 
J05 3 10 315 320 

nl a ™ S at ^ tg £ tg a ?° tca ttfc ctg tca at g aa g ggc eta acg gag ioob 

Gly Arg His Leu Met Thr Ser Phe Leu Ser Met Lys Gly Leu Thr Glu 
325 330 335 

ttg act aat gtt gtg gga aat tgg age gat aag cgt gct tct gec gtg 1056 
Leu Thr Asn Val Val Gly Asn Trp Ser Asp Lys Arg Ala Ser Ala Val 
340 345 " 350 

IT 399 a u 3 a f g tat act cat cag ata aca g ca ata cct gat cac tac 1104 
Ala Arg Thr Thr Tyr Thr His Gin He Thr Ala He Pro Asp His Tyr 
355 360 365 

111 ?? a r tH ?^ l Ct C9g tac tat gca tat g at cca ata tca aag gaa 1152 
Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr Asp Pro He Ser Lys Glu 
370 375 380 

£? f^ a g f a r tg f ag gat gag act aat cca att g fl g g g g fc gg cag cat 1200 

Met He Ala Leu Lys Asp Glu Thr Asn Pro He Glu Glu Tro Gin 
J85 390 



1248 
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tgg aat ggg ata ata tea cag gag gta eta gac tac ctt tea tec tac 1296 

Trp Asn Gly lie lie Ser Gin Glu Val Leu Asp Tyr Leu Ser Ser Tyr 

420 425 430 

ata aat aga cgc ata taatga 1317 

lie Asn Arg Arg lie 

435 : 



<210> 4 
<211> 437 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TAT-Flpe 

<400> 4 

Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
1 5 10 15 

Gin Phe Asp He Leu Cys Lys Thr Pro Pro Lys Val Leu Val Arg Gin 
20 25 " 30 

Phe Val Glu Arg Phe Glu Arg Pro Ser Gly Glu Lys He Ala Ser Cys 
35 40 45 

Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met He Thr His Asn Gly Thr 
50 55 60 

Ala He Lys Arg Ala Thr Phe Met Ser Tyr Asn Thr He He Ser Asn 
65 70 75 80 

Ser Leu Ser Phe Asp lie Val Asn Lys Ser Leu Gin Phe Lys Tyr Lys 
85 90 95 

Thr Gin Lys Ala Thr He Leu Glu Ala Ser Leu Lys Lys Leu He Pro 
100 105 110 

Ala Trp Glu Phe Thr He He Pro Tyr Asn Gly Gin Lys His Gin Ser 
115 120 125 

Asp He Thr Asp He Val Ser Ser Leu Gin Leu Gin Phe Glu Ser Ser 
130 135 140 

Glu Glu Ala Asp Lys Gly Asn Ser His Ser Lys Lys Met Leu Lys Ala 
145 150 155 160 

Leu Leu Ser Glu Gly Glu Ser He Trp Glu He Thr Glu Lys He Leu 
165 • 170 175 

Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr Lys Thr Lys Thr Leu Tyr 
180 " 185 190 

Gin Phe Leu Phe Leu Ala Thr Phe He Asn Cys Gly Arg Phe Ser Asp 
195 200 ' 205 

He Lys Asn Val Asp Pro Lys Ser Phe Lys Leu Val Gin Asn Lys Tyr 
210 215 220 

Leu Gly Val He He Gin Cys Leu Val Thr Glu Thr Lys Thr Ser Val 
225 230 235 240 
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Ser Arg His lie Tyr Phe Phe Ser Ala Arg Gly Arg lie Asp Pro Leu 
245 250 255 

Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser Glu Pro Val Leu Lys Arg 
260 265 270 

Val Asn Arg Thr Gly Asn Ser Ser Ser Asn Lys Gin Glu Tyr Gin Leu 
275 • 280 ~ 285 

Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn Lys Ala Leu Lys Lys Asn 
290 295 300 

Ala Pro Tyr Pro He Phe Ala He Lys Asn Gly Pro Lys Ser His He 
305. 310 315 ~ 320 

Gly Arg His Leu Met Thr Ser Phe Leu Ser Met Lys Gly Leu Thr Glu 
325 330 ' 335 

Leu Thr Asn Val Val Gly Asn Trp Ser Asp Lys Arg Ala Ser Ala Val 
340 345 350 

Ala Arg Thr Thr Tyr Thr His Gin He Thr Ala He Pro Asp His Tyr 
355 360 365 

Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr Asp Pro He Ser Lys Glu 
370 375 380 

Met He Ala Leu Lys Asp Glu Thr Asn Pro He Glu Glu Trp Gin His 
385 390 395 400 

He Glu Gin Leu Lys Gly Ser Ala Glu Gly Ser He Arg Tyr Pro Ala 
405 410 415 

Trp Asn Gly He He Ser Gin Glu Val Leu Asp Tyr Leu Ser Ser Tyr 
420 425 430 

He Asn Arg Arg He 
435 



<210> 5 
<211> 2004 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22-Cre 

<220> 

<221> CDS 

<222> (1) . . (2001) 

<400> 5 

atg acc tct cgc cgc tec gtg aag teg ggt ccg egg gag gtt ccg cgc 48 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
15 10 15 



gat gag tac gag gat ctg tac tac acc ccg tct 
Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser 
20 " 25 



tea ggt atg gcg agt 96 
Ser Gly Met Ala Ser 
30 
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ccc gat agt ccg cct gac acc tec cgc cgt ggc gec eta cag aca cgc 144 
Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Sg 
Ji> 40 45 

teg cgc cag agg ggc gag gtc cgt ttc gtc cag tac gac gag teg gat 192 
Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp GlG Ser !sp 
, 55 60 

9 ?° ? tC ™ aC 999 ggC tcg tct tcc 9 aa 9 ac gaa cac ccg gag 240 
Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Lp Glu His Pro Glu 

70 



205 



220 



288 



75 80 

p™ a" ^k 9 *" Cgt CCC gtt tcc 9 C 9 9t* fc tg tcc ggc ccg 
Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 go 9 | 

ggg cct gcg egg gcg cct ccg cca ccc get ggg tcc gga ggg gec gqa 336 
Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly 8? Ma Gly 

100 105 no 

a™ tk 3 o C ° 5£ C 9CC CCC cgg gcc ccc c 9 a acc cag egg gtg gcg 
Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 

115 120 12 5 

Iht ?v? ??° d CC ?? 9 ?f C CCg gC9 gcg gag acc acc C 9 C 99° a 99 aaa 432 
Thr Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 

li0 135 140 



384 



480 



528 



tcg gcc cag cca gaa tcc gcc gca etc cca gac gcc ccc gcg tcg acg 
Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro 111 Ser Thr 
145 150 155 16Q 

?? 9 t° a C Cga tcc aa 9 aca ccc cag ggg ctg gcc aga aag ctg 
Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg LyI Leu 
165 170 i 7 5 

h 3 « Si* e 9C a v° 9 f° CCC cca aac ccc 9 ao gcg cca tgg acc ccc egg 576 
His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

H fc ? ?f C 2 g ° fctt aaC aa9 cgc gtc ttc tgc gcc gcg gtc ggg cgc ctg 624 
Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala 111 Val Gly Arg Leu 
195 200 



ITU a?= 5 t9 S 3t g ?° atg gcg gcg 9tc cag etc tgg gac atg tcg 672 
Ala Ala Met Hrs Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 



a™ b 90 ^ ga ° ^ a ga ° Ctc aac gaa etc ctt ggc ate acc acc 720 
Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
225 230 235 



240 

ate cgc gtg acg gtc tgc gag ggc aaa aac ctg ctt cag cgc gcc aac 7 68 
lie Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 255 

111 it» £? » at S° a ga ° gtg 9tg cag gac gtc gac gcg gcc acg gcg 
Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

?hr a™ r? 9 £ gt c Ct 9 f 9 gcg tcg cgc ccc acc gag cga cct cga gcc 
Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
275 280 285 



.J 

816 
864 
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cca gcc cgc tec get tct cgc ccc aga egg ccc gtc gag ggt ace gag 
Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 



912 



etc gga tec act agt cca gtg tgg tgg aat tct gca gat ate cag cac 960 
Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp lie Gin His 
305 310 315 320 

agt ggc ggc cgc atg tec aat tta ctg acc gta cac caa aat ttg cct 1008 
Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
325 330 335 

gca tta ccg gtc gat gca acg agt gat gag gtt cgc aag aac ctg atg 1056 
Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
340 345 350 

gac atg ttc agg gat cgc cag gcg ttt tct gag cat acc tgg aaa atg 1104 
Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
355 360 365 

ctt ctg tec gtt tgc egg teg tgg gcg gca tgg tgc aag ttg aat aac 1152 
Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 
370 375 380 

egg aaa tgg ttt ccc gca gaa cct gaa gat gtt cgc gat tat ctt eta 1200 
Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
385 390 ' 395 400 

tat ctt cag gcg cgc ggt ctg gca gta aaa act ate cag caa cat ttg 1248 
Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys Thr lie Gin Gin His Leu 
405 410 415 

ggc cag eta aac atg ctt cat cgt egg tec ggg ctg cca cga cca agt 1296 
Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
420 425 430 

gac age aat get gtt tea ctg gtt atg egg egg ate cga aaa gaa aac 1344 
Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg lie Arg Lys Glu Asn 
435 440 445 

gtt gat gcc ggt gaa cgt gca aaa cag get eta gcg ttc gaa .cgc act 1392 
Val Asp Ala Gly Glu Arg Ala Lys Gin Ala- Leu Ala Phe Glu Arg Thr 
450 455- 460 

gat ttc gac cag gtt cgt tea etc atg gaa aat age gat cgc tgc cag 1440 
Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 
46 5 470 475 480 

gat ata cgt aat ctg gca ttt ctg ggg att get tat aac acc ctg tta 14 88 
Asp lie Arg Asn Leu Ala Phe Leu Gly He Ala Tyr Asn Thr Leu Leu 
485 490 495 

cgt ata gcc gaa att gcc agg ate agg gtt aaa gat ate tea cgt act 1536 
Arg He Ala Glu He Ala Arg He Arg Val Lys Asp He Ser Arg Thr 
500 505 ~ " 510 

gac ggt ggg aga atg tta ate cat att ggc aga acg- aaa acg ctg gtt 1584 
Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 
515 520 " 525 

age acc gca ggt gta gag aag gca ctt age ctg ggg gta act aaa ctg 1632 
Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
530 535 540 
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gtc gag cga tgg att tec gtc tct ggt gta get gat gat ccg aat aac 1680 
Val Glu Arg Trp lie Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
545 550 555 560 

tac ctg ttt tgc egg gtc aga aaa aat ggt gtt gec gcg cca tct gee 1728 
Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
565 570 575 

ace age cag.cta tea act cgc gee ctg gaa ggg att ttt gaa gca act 1776 
Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly He Phe Glu Ala Thr 
580 585 " 590 

cat cga ttg att tac ggc get aag gat gac tct ggt cag aga tac ctg 1824 
His Arg Leu He Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
595 600 605 

gec tgg tct gga cac agt gec cgt gtc gga gec gcg cga gat atg gec 1872 
. Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
610 615 620 

cgc get gga gtt tea ata ccg gag ate atg caa get ggt ggc tgg acc 1920 
Arg Ala Gly Val Ser He Pro Glu He Met Gin Ala Gly Gly Trp Thr 
625 630 635 640 

aat gta aat att gtc atg aac tat ate cgt aac ctg gat agt gaa aca 1968 
Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
645 650 " 655 

ggg gca atg gtg cgc ctg ctg gaa gat ggc gat tag 200'4 
Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp 
660 665 

<210> 6 
<211> 667 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22-Cre 

<400> 6 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
15 10 15 

Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 30 

Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 

Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 55 60 

Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 110 

Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
115 120 125 
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Thr Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 
XJ3y) 135 140 

Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 

150 155 iso 

Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 ivo ^ 5 

His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
I 80 185 190 y 

Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 

Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 

215 220 

Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
" S 23 ° 235 240 

He Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 . 255 

Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
275 280 285 

Pr ° ™* Arg Ser Ma Ser Pro ^9 Pr ° v al Glu Gly. Thr Glu 
290 295 300 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp He Gin His 
305 310 315 320 

Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
325 330 335 

Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
340 345 350 

Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
355 360 365 

Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 
370 375 380 

Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
J85 390 395 - 400 

Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys Thr He Gin Gin His Leu 
405 410 415 

Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
420 425 430 

Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg He Arg Lys Glu Asn 
. 435 44Q 445 

Val Asp Ala Gly Glu Arg Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr 
450 455 460 
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Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 
465 470 475 480 

Asp He Arg Asn Leu Ala Phe Leu Gly He Ala Tyr Asn Thr Leu Leu 
485 490 495 

Arg He Ala Glu He Ala Arg He Arg Val Lys Asp He Ser Arg Thr 
500 5 505 * 510 

Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 
515 520 525 

Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
530 535 540 

Val Glu Arg Trp He Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
545 550 555 560 

Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
565 570 575 

Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly He Phe Glu Ala Thr 
580 585 590 

His Arg Leu lie Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
595 600 605 

Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
610 615 620 

Arg Ala Gly Val Ser He Pro Glu He Met Gin Ala Gly Gly Trp Thr 
625 630 635 "* " .640 

Asn Val Asn He Val Met Asn Tyr lie Arg Asn Leu Asp Ser Glu Thr 
645 650 655 

Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp 
660 665 



<210> 7 
<211> 2247 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22-Flpe 

<220> 

<221> CDS 

<222> (1)..(2241) 

<400> 7 

atg acc tct cgc cgc tec gtg aag teg ggt ccg egg gag gtt ccg cgc 48 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 

1 5 10 15 

gat gag tac gag gat ctg tac tac acc ccg tct tea ggt atg gcg agt 96 
Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 30 
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ccc gat agt ccg cct gac acc tec cgc cgt ggc gec eta cag aca cgc 144 
Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 " 45 

teg cgc cag agg ggc gag gtc cgt ttc gtc cag tac gac gag teg gat 192 
Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 55 60 

tat gee etc tac ggg ggc teg tct tec gaa gac gac gaa cac ccg- gag 240 
Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

gtc ccc egg acg egg cgt ccc gtt tec ggg gcg gtt ttg tec ggc ccg 288 
Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

ggg cct gcg egg gcg cct ccg cca ccc get ggg tec gga ggg gee gga 336 
Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 110 

cgc aca ccc acc acc gee ccc egg gee ccc cga acc cag egg gtg gcg 384 
Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
115 120 125 

act aag gec ccc gcg gee ccg gcg gcg gag acc acc cgc ggc agg aaa 432 
Thr Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 
130 135 140 

teg gec cag cca gaa tec gee gca etc cca gac gee ccc gcg teg acg 480 
Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

gcg cca acc cga tec aag aca ccc gcg cag ggg ctg gee aga aag ctg 528 
Ala. Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

cac ttt age acc gee ccc cca aac ccc gac gcg cca tgg acc ccc egg 576 
His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

gtg gec ggc ttt aac aag cgc gtc ttc tgc gee gcg gtc ggg cgc ctg 624 
Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 

gcg gee atg cat gec egg atg gcg gcg gtc cag etc tgg gac atg teg 672 
Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

cgt ccg cgc aca gac gaa gac etc aac gaa etc ctt ggc ate acc acc 720 
Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
225 230 235 240 

ate cgc gtg acg gtc tgc gag ggc aaa aac ctg ctt cag cgc gec aac 7 68 
lie Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 255 

gag ttg gtg aat cca gac gtg gtg cag gac gtc gac gcg gee acg gcg 816 
Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

act cga ggg cgt tct gcg gcg teg cgc ccc acc gag cga cct cga gee 864 
Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
275 280 285 
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cca gcc cgc tec get tct cgc ccc aga egg ccc gtc gag ggt acc gag 912 
Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 

etc gga tec act agt cca gtg tgg tgg aat tct gca gat ate cag cac 960 
Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp lie Gin His 
305 310 315 320 

agt ggc ggc cgc atg agt caa ttt gat ata tta tgt aaa aca cca cct 1008 
Ser Gly Gly Arg Met Ser Gin Phe Asp lie- Leu Cys Lys Thr Pro Pro 
325 330 335 

aag gtc ctg gtt cgt cag ttt gtg gaa agg ttt gaa aga cct tea ggg 1056 
Lys Val Leu Val Arg Gin Phe Val Glu Arg Phe Glu Arg Pro Ser Gly 
340 345 350 

gaa aaa ata gca tea tgt get get gaa eta acc tat tta tgt tgg atg 1104 
Glu Lys lie Ala Ser Cys Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met 
355 360 365 

att act cat aac gga aca gca ate aag aga gcc aca ttc atg age tat 1152 
He Thr His Asn Gly Thr Ala He Lys Arg Ala Thr Phe Met Ser Tyr 
370 375 380 

aat act ate ata age aat teg ctg agt ttc gat att gtc aac aaa tea 1200 
Asn Thr He He Ser Asn Ser Leu Ser Phe Asp He Val Asn Lys Ser 
385 390 395 400 

etc cag ttt aaa tac aag acg caa aaa gca aca att ctg gaa gcc tea . 1248 
Leu Gin Phe Lys Tyr Lys Thr Gin Lys Ala Thr lie Leu Glu Ala Ser 
405 410 415 

tta aag aaa tta att cct get tgg gaa ttt aca att att cct tac aat 1296 
Leu Lys Lys Leu He Pro Ala Trp Glu Phe Thr He He Pro Tyr Asn 
420 425 430 

gga caa aaa cat caa tct gat ate act gat att gta agt agt ttg caa 1344 
Gly Gin Lys His Gin Ser Asp He Thr Asp lie Val Ser Ser Leu Gin 
435 440 445 

tta cag ttc gaa tea teg gaa gaa gca gat aag gga aat age cac agt 1392 
Leu Gin Phe Glu Ser Ser Glu Glu Ala Asp Lys Gly Asn Ser His Ser 
450 455 460 

aaa aaa atg ctt aaa gca ctt eta agt gag ggt gaa age ate tgg gag 1440 
Lys Lys Met Leu Lys Ala Leu Leu Ser Glu Gly Glu Ser He Trp Glu 
465 470 475 480 

ate act gag aaa ata eta aat teg ttt gag tat acc teg aga ttt aca 1488 
He Thr Glu Lys He Leu Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr 
485 490 495 

aaa aca aaa act tta tac caa ttc etc ttc eta get act ttc ate aat 1536 
Lys Thr Lys Thr Leu Tyr Gin Phe Leu Phe Leu Ala Thr Phe He Asn 
500 505 510 

tgt gga aga ttc age gat att aag aac gtt gat ccg aaa tea ttt aaa 1584 
Cys Gly Arg Phe Ser Asp He Lys Asn Val Asp Pro Lys Ser Phe Lys 
515 520 525 

tta gtc caa aat aag tat ctg gga gta ata ate cag tgt tta gtg aca 1632 
Leu Val Gin Asn Lys Tyr Leu Gly Val lie lie Gin Cys Leu Val Thr 
530 535 540 
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gag aca aag aca age gtt agt agg cac ata tac ttc ttt age gca agg 1680 
Glu Thr Lys Thr Ser Val Ser Arg His lie Tyr Phe Phe Ser Ala Arg 
5'45 " 550 555 560 

ggt agg ate gat cca ctt gta tat ttg gat gaa ttt ttg agg aat tct 1728 
Gly Arg lie Asp Pro Leu Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser 
565 570 575 

gaa cca gtc eta aaa cga gta aat agg acc ggc aat tct tea age aac 1776 
Glu Pro Val Leu Lys Arg Val Asn Arg Thr Gly Asn Ser Ser Ser Asn 
580 585 590 

aaa cag gaa tac caa tta tta aaa gat aac tta gtc aga teg tac aac 1824 
Lys Gin Glu Tyr Gin Leu Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn 
595 600 605 

aag get ttg aag aaa aat gcg cct tat cca ate ttt get ata aag aat 
Lys Ala Leu Lys Lys Asn Ala Pro Tyr Pro He Phe Ala He Lys Asn 
610 ^ 615 620 

ggc cca aaa tct cac att gga aga cat ttg atg acc tea ttt ctg tea 
Gly Pro Lys Ser His He Gly Arg His Leu Met Thr Ser Phe Leu Ser 
625 630 635 640 

atg aag ggc eta acg gag ttg act aat gtt gtg gga aat tgg age gat 
Met Lys Gly Leu Thr Glu Leu Thr Asn Val Val Gly Asn Trp Ser Asp 
645 650 655 

aag cgt get tct gee gtg gee agg aca acg tat act cat cag ata aca 
Lys Arg Ala Ser A la Val Ala Ax g Thr Thr Tyr Thr His Gin He Thr 
660 665 670 

gca ata cct gat cac tac ttc gca eta gtt tct egg tac tat gca tat 
Ala He Pro Asp His Tyr Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr 
675 680. 685 

gat cca ata tea aag gaa atg ata gca ttg aag gat gag act aat cca 2112 
Asp Pro He Ser Lys Glu Met He Ala Leu Lys Asp Glu Thr Asn Pro 
690 695 700 

att gag gag tgg cag cat ata gaa cag eta aag ggt agt get gaa gga 2160 
He Glu Glu Trp Gin His He Glu Gin Leu Lys Gly Ser Ala Glu Gly 
705 ~ 710 715 720 

age ata cga tac ccc gca tgg aat ggg ata ata tea cag gag gta eta 2208 
Ser He Arg Tyr Pro Ala Trp Asn Gly He He Ser Gin Glu Val Leu 
725 730 735 

gac tac ctt tea tec tac ata aat aga cgc ata taatga 2247 
Asp Tyr Leu Ser Ser Tyr lie Asn Arg Arg lie 
740 745 



1872 



1920 



1968 



2016 



2064 



<210> 8 
<211> 747 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22-Flpe 

<400> 8 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
15 10 15 
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Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 30 

Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 * 40 45 

Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 55 60 

Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 110 

Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
115 120 125 

Thr Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 
130 135 140 

Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 

Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
225 230 235 240 

lie Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245- 250 255 

Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

Thr Arg' Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
275 280 285 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp He Gin His 
305 310 315 320 

Ser Gly Gly Arg Met Ser Gin Phe Asp He Leu Cys Lys Thr Pro Pro 
325 330 335 



Lys Val Leu Val Arg Gin Phe Val Glu Arg Phe Glu Arg Pro Ser Gly 
340 345 " 350 
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Glu Lys lie Ala Ser Cys Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met 
355 360 365 

lie Thr His Asn Gly Thr Ala lie Lys Arg Ala Thr Phe Met Ser Tyr 
370 375 380 

Asn Thr lie lie Ser Asn Ser Leu Ser Phe Asp lie Val Asn Lys Ser 
385 390 395 400 

Leu Gin Phe Lys Tyr Lys Thr Gin Lys Ala Thr He Leu Glu Ala Ser 
405 410 415 

Leu Lys Lys Leu He Pro Ala Trp Glu Phe Thr He He Pro Tyr Asn 
420 425 430 

Gly Gin Lys His Gin Ser Asp He Thr Asp He Val Ser Ser Leu Gin 
435 440 445 

Leu Gin Phe Glu Ser Ser Glu Glu Ala Asp Lys Gly Asn Ser His Ser 
450 455 460 

Lys Lys Met Leu Lys Ala Leu Leu Ser Glu Gly Glu Ser He Trp Glu 
465 470 475 480 

He Thr Glu Lys He Leu Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr 
485 490 495 

Lys Thr Lys Thr Leu Tyr Gin Phe Leu Phe Leu Ala Thr Phe He Asn 
500 505 510 

Cys Gly Arg Phe Ser Asp He Lys Asn Val Asp Pro Lys Ser Phe Lys 
515 520 525 

Leu Val Gin Asn Lys Tyr Leu Gly Val He He Gin Cys Leu Val Thr 
530 535 540 

Glu Thr Lys Thr Ser Val Ser Arg His He Tyr Phe Phe Ser Ala Arg 
545 550 555 560 

Gly Arg He Asp Pro Leu Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser 
565 570 575 

Glu Pro Val Leu Lys Arg Val Asn Arg Thr Gly Asn Ser Ser Ser Asn 
580 585 590 

Lys Gin Glu Tyr Gin Leu Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn 
595 600 605 

Lys Ala Leu Lys Lys Asn Ala Pro Tyr Pro He Phe Ala He Lys Asn 
610 615 620 

Gly Pro Lys Ser His He Gly Arg His Leu Met Thr Ser Phe Leu Ser 
625 630 635 640 

Met Lys Gly Leu Thr Glu Leu Thr Asn Val Val Gly Asn Trp Ser Asp 
645 650 655 

Lys Arg Ala Ser Ala Val Ala Arg Thr Thr Tyr Thr His Gin He Thr 
660 665 670 

Ala He Pro Asp His Tyr Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr 
675 680 " 685 
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Asp Pro He Ser Lys Glu Met He Ala Leu Lys Asp Glu Thr Asn Pro 
690 695 700 

He Glu Glu Trp Gin His He Glu Gin Leu Lys Gly Ser Ala Glu Gly 
705 710 715 720 

Ser He Arg Tyr Pro Ala Trp Asn Gly He He Ser Gin Glu Val Leu 
725 730 735 

Asp Tyr Leu Ser Ser Tyr He Asn Arg Arg He 
740 745 

'■ - v 

<210> 9 
<211> 33 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 9 

tacggccgca agaagcgccg ccaacgccgc cgc 33 

<210> 10 
<211> 11 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 10 

Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg 
1 5 10 



<210> 11 
<211> 42 
<212> DNA 

<213> Human immunodeficiency virus 

<220> 
<221> CDS 
<222> (4) . . (42) 

<400> 11 

atg ggc tac ggc cgc aag aag cgc cgc caa cgc cgc cgc ggc 42 
Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly 
15 10 



<210> 12 
<211> 13 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 12 

Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly 
1 5 10 



<210> 13 
<211> 1623 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein deltaVP22cre-StrepTag 

<220> 

<221> CDS 

<222> (1) . . (1617) 

<400> 13 

atg get age atg act ggt gga cag caa atg ggt egg gat ccg teg acg 48 
Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg Asp Pro Ser Thr 
1 5 10 15 

gcg cca acc cga tec aag aca ccc gcg cag ggg ctg gec aga aag ctg 96 
Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
20 25 30 

cac ttt age acc gec ccc cca aac ccc gac gcg cca tgg acc ccc egg 144 
His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
35 40 45 

gtg gec ggc ttt aac aag cgc gtc ttc tgc gec gcg gtc ggg cgc ctg 192 
Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
50 55 60 

gcg gee atg cat gee egg atg gcg get gtc cag etc tgg gac atg teg 240 
Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met ' Ser 
65 70 75 80 

cgt ccg cgc aca gac gaa gac etc aac gaa etc ctt ggc ate acc acc 288 
Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
85 90 95 

ate cgc gtg acg gtc tgc gag ggc aaa aac ctg ctt cag cgc gec aac 336 
lie Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
100 105 110 

gag ttg gtg aat cca gac gtg gtg cag gac gtc gac gcg gec acg gcg 384 
Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
115 120 125 

act cga ggg cgt tct gcg gcg teg cgc ccc acc gag cga cct cga gec 432 
Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
130 135 140 

cca gec cgc tec get tct cgc ccc aga egg ccc gtc gag ggt acc gag 480 
Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
145 150 155 160 

etc gga tec act agt cca gtg tgg tgg aat tct gca gat ate cag cac 528 
Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp He Gin His 
165 170 175 

agt ggc ggc cgc atg tec aat. tta ctg acc gta cac caa aat ttg cct 576 
Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
180 185 190 

gca tta ccg gtc gat gca acg agt gat gag gtt cgc aag aac Ctg atg 624 
Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
195 200 205 

gac atg ttc agg gat cgc cag gcg ttt tct gag cat acc tgg aaa atg 672, 
Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
210 215 220 
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ctt ctg tec gtt tgc egg teg tgg gcg gca tgg tgc aag ttg aat aac 720 ' 

Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 
225 230 235 240 

egg aaa tgg ttt ccc gca gaa cct gaa gat gtt cgc gat tat ctt eta 768 
Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
245 250 255 

tat ctt cag gcg cgc ggt ctg gca gta aaa act ate cag caa cat ttg 816 
Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys Thr lie Gin Gin His Leu 
260 265 270 

ggc cag eta aac atg ctt cat cgt egg tec ggg ctg cca cga cca agt 864 
Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
275 280 " 285 

gac age aat get gtt tea ctg gtt atg egg egg ate cga aaa gaa aac 912 
Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg lie Arg Lys Glu Asn 
290 295 " 300 

gtt gat gee ggt gaa cgt gca aaa cag get eta gcg ttc gaa cgc act 960 
Val Asp Ala Gly Glu Arg Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr 
305 310 315 320 

gat ttc gac cag gtt cgt tea etc atg gaa aat age gat cgc tgc cag 1008 
Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 
325 330 335 

gat ata cgt aat ctg gca ttt ctg ggg att get tat aac ace ctg tta 1056 
Asp He Arg Asn Leu Ala Phe Leu Gly He Ala Tyr Asn Thr Leu Leu 
340 345 350 

cgt ata gee gaa att gee agg ate agg gtt aaa gat ate tea cgt act 1104 
Arg He Ala Glu He Ala Arg He Arg Val Lys Asp He Ser Arg Thr 
355 360 365 

gac ggt ggg aga atg tta ate cat att ggc aga acg aaa acg ctg gtt 1152 
Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 
370 375 380 

age acc gca ggt gta gag aag gca ctt age ctg ggg gta act aaa ctg 1200 
Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
385 390 395 400 

gtc gag cga tgg att tec gtc tct ggt gta get gat gat ccg aat aac 1248 
Val Glu Arg Trp He Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
405 410 415 

tac ctg ttt tgc egg gtc aga aaa aat ggt gtt gec gcg cca tct gee 1296 
Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
420 425 430 

acc age cag eta tea act cgc gee ctg gaa ggg att ttt gaa gca act 1344 
Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly He Phe Glu Ala Thr 
435 440 445 

cat cga ttg att tac ggc get aag gat gac tct ggt cag aga tac ctg 1392 
His Arg Leu He Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
450 455 460 

gec tgg tct gga cac agt gee cgt gtc gga gee gcg cga gat atg gee 1440 
Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
4 ^5 470 475 " 480 
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cgc get gga gtt tea ata ccg gag ate atg caa get ggt ggc tgg acc 14 88 
Arg Ala Gly Val Ser lie Pro Glu He Met Gin Ala Gly Gly Trp Thr 
485 490 495 

aat gta aat att gtc atg aac tat ate cgt aac ctg gat agt gaa aca 1536 
Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
500 505 " 510 

ggg gca atg gtg cgc ctg ctg gaa gat ggc gat ggt ate gaa ggt cgt 158 4 
Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp Gly He Glu Gly Arg 
515 520 525 

ggt age get tgg cgt cac ccg cag ttc ggt ggt taataa 1623 
Gly Ser Ala Trp Arg His Pro Gin Phe Gly Gly 
530 535 

<210> 14 
<211> 539 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein deltaVP22cre-StrepTag 

<400> 14 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg Asp Pro Ser Thr 
15 10 15 

Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
20 25 30 

His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
35 40 45 

Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
50 55 60 

Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
65 70 75 80 

Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly He Thr Thr 
85 90 95 

He Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
100 105 110 

Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
115 120 125 

Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
130 135 " 140 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
145 150 155 160 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp He Gin His 
165 170 175 

Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
180 185 190 

Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
195 200 205 



SUBSTITUTE SHEET (RULE 26) 



WO 01/49832 PCT/EP01/00060 

47 

Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
210 215 220 

Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 
225 230 235 240 

Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
245 250 " 255 

Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys Thr lie Gin Gin His Leu 
260 265 270 

Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
275 280 285 

Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg lie Arg Lys Glu Asn 
290 295 * 300 

Val Asp Ala Gly Glu Arg Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr 
305 310 315 320 

Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 
325 330 335 

Asp He Arg Asn Leu Ala Phe Leu Gly He Ala Tyr Asn Thr Leu Leu 
340 345 350 

Arg He Ala Glu He Ala Arg He Arg Val Lys Asp He Ser Arg Thr 
355 360 365 

Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 
370 375 380 

Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
385 390 395 . 400 

Val Glu Arg Trp He Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
405 410 415 

Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
420 425 430 

Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly He Phe Glu Ala Thr 
435 440 ~ 445 

His Arg Leu He Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
450 455 460 

Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
465 470 475 480 

Arg Ala Gly Val Ser He Pro Glu He Met Gin Ala Gly Gly Trp Thr 
485 490 " 4 95 

Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
500 505 510 

Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp Gly He Glu Gly Arg 
515 520 * 525 

Gly Ser Ala Trp Arg His Pro Gin Phe Gly Gly 
530 535 
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<210> 15 
<211> 5953 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pCRT7-deltaVPCS 

<400> 15 

cgatggattt ccgtctctgg tgtagctgat gatccgaata actacctgtt ttgccgggtc 60 
agaaaaaatg gtgttgccgc gccatctgcc accagccagc tatcaactcg cgccctggaa 120 
gggatttttg aagcaactca tcgattgatt tacggcgcta aggatgactc tggtcagaga 180 
tacctggcct ggtctggaca cagtgcccgt gtcggagccg cgcgagatat ggcccgcgct 240 
ggagtttcaa taccggagat catgcaagct ggtggctgga ccaatgtaaa tattgtcatg 300 
aactatatcc gtaacctgga tagtgaaaca ggggcaatgg tgcgcctgct ggaagatggc 360 
gatggtatcg aaggtcgtgg tagcgcttgg cgtcacccgc agttcggtgg ttaataagct 420 
tcgaacaaaa actcatctca gaagaggatc tgaatatgca taccggtcat catcaccatc 480 
accattgagt tttgagcaat aactagcata accccttggg gcctctaaac gggtcttgag 540 
gggttttttg ctgaaaggag gaactatatc cggatatcca caggacgggt gtggtcgcca 600 
tgatcgcgta gtcgatagtg gctccaagta gcgaagcgag caggactggg cggcggccaa 660 
agcggtcgga cagtgctccg agaacgggtg cgcatagaaa ttgcatcaac gcatatagcg 720 
ctagcagcac gccatagtga ctggcgatgc tgtcggaatg gacgatatcc cgcaagaggc 780 
ccggcagtac cggcataacc aagcctatgc ctacagcatc cagggtgacg gtgccgagga 840 
tgacgatgag cgcattgtta gatttcatac acggtgcctg actgcgttag caatttaact 900 
gtgataaact accgcattaa agcttatcga tgataagctg tcaaacatga gaattaattc 960 
ttagaaaaac tcatcgagca tcaaatgaaa ctgcaattta ttcatatcag gattatcaat 1020 
accatatttt tgaaaaagcc gtttctgtaa tgaaggagaa aactcaccga ggcagttcca 1080 
taggatggca agatcctggt atcggtctgc gattccgact cgtccaacat caatacaacc 114 0 
tattaatttc ccctcgtcaa aaataaggtt atcaagtgag aaatcaccat gagtgacgac 1200 
tgaatccggt gagaatggca aaagcttatg catttctttc cagacttgtt caacaggcca 1260 
gccattacgc tcgtcatcaa aatcactcgc atcaaccaaa ccgttattca ttcgtgattg 1320 
cgcctgagcg agacgaaata cgcgatcgct gttaaaagga caattacaaa caggaatcga 1380 
atgcaaccgg cgcaggaaca ctgccagcgc atcaacaata ttttcacctg aatcaggata 14 40 
ttcttctaat acctggaatg ctgttttccc ggggatcgca gtggtgagta accatgcatc 1500 
atcaggagta cggataaaat gcttgatggt cggaagaggc ataaattccg tcagccagtt 1560 
tagtctgacc atctcatctg taacatcatt ggcaacgcta cctttgccat gtttcagaaa 1620 
caactctggc gcatcgggct tcccatacaa tcgatagatt gtcgcacctg attgcccgac 1680 
attatcgcga gcccatttat acccatataa atcagcatcc atgttggaat ttaatcgcgg 1740 
cctcgagcaa gacgtttccc gttgaatatg gctcataaca ccccttgtat tactgtttat 1800 
gtaagcagac agttttattg ttcatgacca aaatccctta acgtgagttt tcgttccact 1860 
gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg agatcctttt tttctgcgcg 1920 
taatctgctg cttgcaaaca aaaaaaccac cgctaccagc ggtggtttgt ttgccggatc 1980 
aagagctacc aactcttttt ccgaaggtaa ctggcttcag cagagcgcag ataccaaata 2040 
ctgtccttct agtgtagccg tagttaggcc accacttcaa gaactctgta gcaccgccta 2100 
catacctcgc tctgctaatc ctgttaccag tggctgctgc cagtggcgat aagtcgtgtc 2160 
ttaccgggtt ggactcaaga cgatagttac cggataaggc gcagcggtcg ggctgaacgg 2220 
ggggttcgtg cacacagccc agcttggagc gaacgaccta caccgaactg agatacctac 2280 
agcgtgagct atgagaaagc gccacgcttc ccgaagggag aaaggcggac aggtatccgg 2340 
taagcggcag ggtcggaaca ggagagcgca cgagggagct tccaggggga aacgcctggt 24 00 
atctttatag tcctgtcggg tttcgccacc tctgacttga gcgtcgattt ttgtgatgct 24 60 
cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc ggccttttta cggttcctgg 2520 
ccttttgctg gccttttgct cacatgttct ttcctgcgtt atcccctgat tctgtggata 2580 
accgtattac cgcctttgag tgagctgata ccgctcgccg cagccgaacg accgagcgca 2640 
gcgagtcagt gagcgaggaa gcggaagagc gcctgatgcg gtattttctc cttacgcatc 2700 
tgtgcggtat ttcacaccgc atatatggtg cactctcagt acaatctgct ctgatgccgc 27 60 
atagttaagc cagtatacac tccgctatcg ctacgtgact gggtcatggc tfjcgccccga 2820 
cacccgccaa cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac 2880 
agacaagctg tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg 2940 
aaacgcgcga ggcagctgcg gtaaagctca tcagcgtggt cgtgaagcga ttcacagatg 3000 
tctgcctgtt catccgcgtc cagctcgttg. agtttctcca gaagcgttaa tgtctggctt 3060 
ctgataaagc gggccatgtt aagggcggtt ttttcctgtt tggtcactga tgcctccgtg 3120 
taagggggat ttctgttcat gggggtaatg ataccgatga aacgagagag gatgctcacg 3180 
atacgggtta ctgatgatga acatgcccgg ttactggaac gttgtgaggg taaacaactg 3240 
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gcggtatgga tgcggcggga ccagagaaaa atcactcagg gtcaatgcca gcgcttcgtt 3300 
aatacagatg taggtgttcc acagggtagc cagcagcatc ctgcgatgca gatccggaac 3360 
ataatggtgc agggcgctga cttccgcgtt tccagacttt acgaaacacg gaaaccgaag 3420 
accattcatg ttgttgctca ggtcgcagac gttttgcagc agcagtcgct tcacgttcgc 3480 
tcgcgtatcg gtgattcatt ctgctaacca gtaaggcaac cccgccagcc tagccgggtc 354 0 
ctcaacgaca ggagcacgat catgcgcacc cgtggccagg acccaacgct gcccgagatg 3600 
cgccgcgtgc ggctgctgga gatggcggac gcgatggata tgttctgcca agggttggtt 3660 
tgcgcattca cagttctccg caagaattga ttggctccaa ttcttggagt ggtgaatccg 3720 
ttagcgaggt gccgccggct tccattcagg tcgaggtggc ccggctccat gcaccgcgac 3780 
gcaacgcggg gaggcagaca aggtataggg cggcgcctac aatccatgcc aacccgttcc 384 0 
atgtgctcgc cgaggcggca taaatcgccg tgacgatcag cggtccagtg atcgaagtta 3900 
ggctggtaag agccgcgagc gatccttgaa gctgtccctg atggtcgtca tctacctgcc 3960 
tggacagcat ggcctgcaac gcgggcatcc cgatgccgcc ggaagcgaga agaatcataa 4020 
tggggaaggc catccagcct cgcgtcgcga acgccagcaa gacgtagccc agcgcgtcgg 4080 
ccgccatgcc ggcgataatg gcctgcttct cgccgaaacg tttggtggcg ggaccagtga 4140 
cgaaggcttg agcgagggcg tgcaagattc cgaataccgc aagcgacagg ccgatcatcg 4200 
tcgcgctcca gcgaaagcgg tcctcgccga aaatgaccca gagcgctgcc ggcacctgtc 4260 
ctacgagttg catgataaag aagacagtca taagtgcggc gacgatagtc atgccccgcg 4320 
cccaccggaa ggagctgact gggttgaagg ctctcaaggg catcggtcga cgctctccct 4380 
tatgcgactc ctgcattagg aagcagccca gtagtaggtt gaggccgttg agcaccgccg 4440 
ccgcaaggaa tggtgcatgc aaggagatgg cgcccaacag tcccccggcc acggggcctg 4500 
ccaccatacc cacgccgaaa caagcgctca tgagcccgaa gtggcgagcc cgatcttccc 4560 
catcggtgat gtcggcgata taggcgccag caaccgcacc tgtggcgccg gtgatgccgg 4 620 
ccacgatgcg tccggcgtag aggatcgaga tctcgatccc gcgaaattaa tacgactcac 4 680 
tatagggaga ccacaacggt ttccctctag aaataatttt gtttaacttt aagaaggaga 4740 
tatacatatg gctagcatga ctggtggaca gcaaatgggt cgggatccgt cgacggcgcc 4800 
aacccgatcc aagacacccg cgcaggggct ggccagaaag ctgcacttta gcaccgcccc 4860 
cccaaacccc gacgcgccat ggaccccccg ggtggccggc tttaacaagc gcgtcttctg 4 920 
cgccgcggtc gggcgcctgg cggccatgca tgcccggatg gcggctgtcc agctctggga 4 980 
catgtcgcgt ccgcgcacag acgaagacct caacgaactc cttggcatca ccaccatccg 5040 
cgtgacggtc tgcgagggca aaaacctgct tcagcgcgcc aacgagttgg tgaatccaga 5100 
cgtggtgcag gacgtcgacg cggccacggc gactcgaggg cgttctgcgg cgtcgcgccc 5160 
caccgagcga cctcgagccc cagcccgctc cgcttctcgc cccagacggc ccgtcgaggg 5220 
taccgagctc ggatccacta gtccagtgtg gtggaattct gcagatatcc agcacagtgg 5280 
cggccgcatg tccaatttac tgaccgtaca ccaaaatttg cctgcattac cggtcgatgc 5340 
aacgagtgat gaggttcgca agaacctgat ggacatgttc agggatcgcc aggcgttttc 5400 
tgagcatacc tggaaaatgc ttctgtccgt ttgccggtcg tgggcggcat ggtgcaagtt 54 60 
gaataaccgg aaatggtttc ccgcagaacc tgaagatgtt cgcgattatc ttctatatct 5520 
tcaggcgcgc ggtctggcag taaaaactat ccagcaacat ttgggccagc taaacatgct 5580 
tcatcgtcgg tccgggctgc cacgaccaag tgacagcaat gctgtttcac tggttatgcg 5640 
gcggatccga aaagaaaacg ttgatgccgg tgaacgtgca aaacaggctc tagcgttcga 5700 
acgcactgat ttcgaccagg ttcgttcact catggaaaat agcgatcgct gccaggatat 5760 
acgtaatctg gcatttctgg ggattgctta taacaccctg ttacgtatag ccgaaattgc 5820 
caggatcagg gttaaagata tctcacgtac tgacggtggg agaatgttaa tccatattgg 5880 
cagaacgaaa acgctggtta gcaccgcagg tgtagagaag gcacttagcc tgggggtaac 5940 
taaactggtc gag 5953 



<210> 16 
<211> 4727 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pT7-TACS 

<400> 16 

atccggatat agttcctcct ttcagcaaaa aacccctcaa gacccgttta gaggccccaa 60 

ggggttatgc tagttattgc tcagcggtgg cagcagccaa ctcagcttcc tttcgggctt 120 

tgttagcagc cggatctcag tggtggtggt ggtggtgctc gagtgcggcc gcaagcttat 180 

taaccaccga actgcgggtg acgccaagcg ctaccapgac cttcgatacc atcgccatct 240 

tccagcaggc gcaccattgc ccctgtttca ctatccaggt tacggatata gttcatgaca 300 

atatttacat tggtccagcc accagcttgc atgatctccg gtattgaaac tccagcgcgg 360 
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gccatatctc gcgcggctcc gacacgggca ctgtgtccag accaggccag gtatctctga 420 

ccagagtcat ccttagcgcc gtaaatcaat cgatgagttg cttcaaaaat cccttccagg 480 

gcgcgagttg atagctggct ggtggcagat ggcgcggcaa caccattttt tctgacccgg 540 

caaaacaggt agttattcgg atcatcagct acaccagaga cggaaatcca tcgctcgacc 600 

agtttagtta cccccaggct aagtgccttc tctacacctg cggtgctaac cagcgttttc 660 

gttctgccaa tatggattaa cattctccca ccgtcagtac gtgagatatc tttaaccctg 720 

atcctggcaa tttcggctat acgtaacagg gtgttataag caatccccag aaatgccaga 780 

ttacgtatat cctggcagcg atcgctattt tccatgagtg aacgaacctg gtcgaaatca 840 

gtgcgttcga acgctagagc ctgttttgca cgttcaccgg catcaacgtt ttcttttcgg 900 

atccgccgca taaccagtga aacagcattg ctgtcacttg gtcgtggcag cccggaccga 960 

cgatgaagca tgtttagctg gcccaaatgt tgctggatag tttttactgt cagaccgcgc 1020 

gcctgaagat atagaagata atcgcgaaca tcttcaggtt ctgcgggaaa ccatttccgg 1080 

ttattcaact tgcaccatgc cgcccacgac cggcaaacgg acagaagcat tttccaggta 1140 

tgctcagaaa acgcctggcg atccctgaac atgtccatca ggttcttgcg aacctcatca 1200 

ctcgttgcat cgaccggtaa tgcaggcaaa ttttggtgta cggtcagtaa attggacatg 1260 

ccgcggcggc gttggcggcg cttcttgcgg ccgtagccca tggtatatct ccttcttaaa 1320 

. gttaaacaaa attatttcta gagggaaacc gttgtggtct ccctatagtg agtcgtatta 1380 

atttcgcggg atcgagatct cgggcagcgt tgggtcctgg ccacgggtgc gcatgatcgt 1440 

gctcctgtcg ttgaggaccc ggctaggctg gcggggttgc cttactggtt agcagaatga 1500 

atcaccgata cgcgagcgaa cgtgaagcga ctgctgctgc aaaacgtctg cgacctgagc 1560 

aacaacatga atggtcttcg gtttccgtgt ttcgtaaagt ctggaaacgc ggaagtcagc 1620 

gccctgcacc attatgttcc ggatctgcat cgcaggatgc tgctggctac cctgtggaac 1680 

acctacatct gtattaacga agcgctggca ttgaccctga gtgatttttc tctggtcccg 1740 

ccgcatccat accgccagtt gtttaccctc acaacgttcc agtaaccggg catgttcatc 1800 

atcagtaacc cgtatcgtga gcatcctctc tcgtttcatc ggtatcatta cccccatgaa 1860 

cagaaatccc ccttacacgg aggcatcagt gaccaaacag gaaaaaaccg cccttaacat 1920 

ggcccgcttt atcagaagcc agacattaac gcttctggag aaactcaacg agctggacgc 1980 

ggatgaacag gcagacatct gtgaatcgct tcacgaccac gctgatgagc tttaccgcag 2040 

ctgcctcgcg cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac 2100 

ggtcacagct tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc 2160 

gggtgttggc gggtgtcggg gcgcagccat gacccagtca cgtagcgata gcggagtgta 2220 

tactggctta actatgcggc atcagagcag attgtactga gagtgcacca tatatgcggt 2280 

gtgaaatacc gcacagatgc gtaaggagaa aataccgcat caggcgctct tccgcttcct 2340 

cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa 2400 

aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa 2460 

aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc 2520 

tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga 2580 

caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc 2640 

cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt 2700 

ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct 2760 

gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg 2820 

agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta 2880 

gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct 2940 

acactagaag gacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa 3000 

gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt 3060 

gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta 3120 

cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat 3180 

caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa 3240 

gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct 3300 

cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta 3360 

cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct 3420 

caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg 3480 

gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa 3540 

gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctgcaggc atcgtggtgt 3600 

cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta 3660 

catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca 3720 

gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta 3780 

ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct 3840 

gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg 3900 

cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac 3960 

tctcaaggat cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact 4020 

gatcttcagc atcttttact ttcaccagcg' tttctgggtg agcaaaaaca ggaaggcaaa 4080 

atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt 414 0 

ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat 4200 
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gtatttagaa aaataaacaa ataggggttc 
aaattgtaaa cgttaatatt ttgttaaaat 
tttttaacca ataggccgaa atcggcaaaa 
tagggttgag tgttgttcca gtttggaaca 
acgtcaaagg gcgaaaaacc gtctatcagg 
aatcaagttt tttggggtcg aggtgccgta 
cccgatttag agcttgacgg ggaaagccgg 
cgaaaggagc gggcgctagg gcgctggcaa 
cacccgccgc gcttaatgcg ccgctacagg 
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cgcgcacatt tccccgaaaa gtgccacctg 4260 

tcgcgttaaa tttttgttaa atcagctcat 4320 

tcccttataa atcaaaagaa tagaccgaga 4380 

agagtccact attaaagaac gtggactcca 4440 

gcgatggccc actacgtgaa ccatcaccct 4500 

aagcactaaa tcggaaccct aaagggagcc 4560 

cgaacgtggc gagaaaggaa gggaagaaag 4620 

gtgtagcggt cacgctgcgc gtaaccacca 4 680 

gcgcgtccca ttcgcca 4 727 



<210> 17 
<211> 4488 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pT7-VPCS 

<400> 17 

aaatcaatct aaagtatata tgagtaaact tggtctgaca gttaccaatg cttaatcagt 60 
gaggcaccta tctcagcgat ctgtctattt cgttcatcca tagttgcctg actccccgtc 120 
gtgtagataa ctacgatacg ggagggctta ccatctggcc ccagtgctgc aatgataccg 180 
cgagacccac gctcaccggc tccagattta tcagcaataa accagccagc cggaagggcc 240 
gagcgcagaa gtggtcctgc aactttatcc gcctccatcc agtctattaa ttgttgccgg 300 
gaagctagag taagtagttc gccagttaat agtttgcgca acgttgttgc cattgctaca 360 
ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat tcagctccgg ttcccaacga 420 
tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag cggttagctc cttcggtcct 480 
ccgatcgttg tcagaagtaa gttggccgca gtgttatcac tcatggttat ggcagcactg 540 
cataattctc ttactgtcat gccatccgta agatgctttt ctgtgactgg tgagtactca 600 ■ 
accaagtcat tctgagaata gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaaca 660 
cgggataata ccgcgccaca tagcagaact ttaaaagtgc tcatcattgg aaaacgttct 720 
tcggggcgaa aactctcaag gatcttaccg ctgttgagat ccagttcgat gtaacccact 780 
cgtgcaccca actgatcttc agcatctttt acttt caeca gcgtttctgg gtgagcaaaa 840 
acaggaaggc aaaatgeege aaaaaaggga ataagggega caeggaaatg ttgaatactc 900 
atactcttcc tttttcaata ttattgaagc atttatcagg gttattgtct catgagegga 960 
tacatatttg aatgtattta gaaaaataaa caaatagggg ttccgcgcac atttccccga 1020 
aaagtgccac ctgaegtcta agaaaccatt attatcatga cattaaccta taaaaatagg 1080 
egtatcaega ggccctttcg tcttcaagaa ttaaaaggat ctaggtgaag atcctttttg 1140 
ataatctcat gaccaaaatc ecttaaegtg agttttcgtt ccactgagcg tcagaccccg 1200 
tagaaaagat caaaggatct tcttgagatc ctttttttct gegegtaate tgctgcttgc 1260 
aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc 1320 
tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt 1380 
ageegtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc 1440 
taatcctgtt accagtggct getgecagtg gcgataagtc gtgtcttacc gggttggact 1500 
caagacgata gttaceggat aaggegcage ggtcgggctg aacggggggt tcgtgcacac 1560 
agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagctatgag 1620 
aaagcgccac gcttcccgaa gggagaaagg eggacaggta teeggtaage ggcagggtcg 1680 
gaacaggaga gegcacgagg gagcttccag ggggaaaege ctggtatctt tatagtcctg 1740 
tegggttteg ccacctctga ettgagegtc gatttttgtg atgetegtea ggggggcgga 1800 
gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt 1860 
ttgetcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct 1920 
ttgagtgagc tgataccget cgccgcagcc gaacgaccga gegcagegag tcagtgagcg 1980 
aggaagegga agagegectg atgeggtatt ttctccttac gcatctgtgc ggtatttcac 2040 
accgcatcag atctgatggt gcactctcag tacaatctgc tetgatgecg catagttaag 2100 
ccagtatata cactccgcta tegctaegtg actgggtcat ggctgcgccc cgacacccgc 2160 
caacacccgc tgacgcgccc tgaegggett gtctgctccc ggcatccgct tacagacaag 2220 
ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg 2280 
cgaggcccag cgattcgaac ttctgataga cttcgaaatt aatacgactc actataggga 2340 
gaccacaacg gtttccctct agaaataatt ttgtttaact ttaagaagga gatatacata 2400 
tgacctctcg ccgctccgtg aagtcgggtc" cgcgggaggt tccgcgcgat gagtacgagg 24 60 
atctgtacta caccccgtct tcaggtatgg cgagtcccga tagtccgcct gacacctccc 2520 
gccgtggcgc cctacagaca cgctcgcgcc agaggggega ggtccgtttc gtccagtacg 2580 
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acgagtcgga ttatgccctc tacgggggct cgtcttccga agacgacgaa cacccggagg 264 0 
tcccccggac gcggcgtccc gtttccgggg cggttttgtc cggcccgggg cctgcgcggg 2700 
cgcctccgcc acccgctggg tccggagggg ccggacgcac acccaccacc gccccccggg 2760 
ccccccgaac ccagcgggtg gcgtctaagg cccccgcggc cccggcggcg gagaccaccc 2820 
gcggcaggaa atcggcccag ccagaatccg ccgcactccc agacgccccc gcgtcgacgg 2880 
cgccaacccg atccaagaca cccgcgcagg ggctggccag aaagctgcac tttagcaccg 2940 
cccccccaaa ccccgacgcg ccatggaccc cccgggtggc cggctttaac aagcgcgtct 3000 
tctgcgccgc ggtcgggcgc ctggcggcca tgcatgcccg gatggcggct gtccagctct 3060 
gggacatgtc gcgtccgcgc acagacgaag acctcaacga actccttggc atcaccacca 3120 
tccgcgtgac ggtctgcgag ggcaaaaacc tgcttcagcg cgccaacgag ttggtgaatc 3180 
cagacgtggt gcaggacgtc gacgcggcca cggcgactcg agggcgttct gcggcgtcgc 3240 
gccccaccga gcgacctcga gccccagccc gctccgcttc tcgccccaga cggcccgtcg 3300 
agggtaccga gctcggatcc actagtccag tgtggtggaa ttctgcagat atccagcaca 3360 
gtggcggccg catgtccaat ttactgaccg tacaccaaaa tttgcctgca ttaccggtcg 3420 
atgcaacgag tgatgaggtt cgcaagaacc tgatggacat gttcagggat cgccaggcgt 3480 
tttctgagca tacctggaaa atgcttctgt ccgtttgccg gtcgtgggcg gcatggtgca 3540 
agttgaataa ccggaaatgg tttcccgcag aacctgaaga tgttcgcgat tatcttctat 3600 
atcttcaggc gcgcggtctg gcagtaaaaa ctatccagca acatttgggc cagctaaaca 3660 
tgcttcatcg tcggtccggg ctgccacgac caagtgacag caatgctgtt tcactggtta 3720 
tgcggcggat ccgaaaagaa aacgttgatg ccggtgaacg tgcaaaacag gctctagcgt 3780 
tcgaacgcac tgatttcgac caggttcgtt cactcatgga aaatagcgat cgctgccagg 3840 
atatacgtaa tctggcattt ctggggattg cttataacac cctgttacgt atagccgaaa 3900 
ttgccaggat cagggttaaa gatatctcac gtactgacgg tgggagaatg ttaatccata 3960 
ttggcagaac gaaaacgctg gttagcaccg caggtgtaga gaaggcactt agcctggggg 4020 
taactaaact ggtcgagcga tggatttccg tctctggtgt agctgatgat ccgaataact 4080 
acctgttttg ccgggtcaga aaaaatggtg ttgccgcgcc atctgccacc agccagctat 4140 
caactcgcgc cctggaaggg atttttgaag caactcatcg attgatttac ggcgctaagg 4200 
atgactctgg tcagagatac ctggcctggt ctggacacag tgcccgtgtc ggagccgcgc 4260 
gagatatggc ccgcgctgga gtttcaatac cggagatcat gcaagctggt ggctggacca 4320 
atgtaaatat tgtcatgaac tatatccgta acctggatag tgaaacaggg gcaatggtgc 4380 
gcctgctgga agatggcgat ggtatcgaag gtcgtggtag cgcttggcgt cacccgcagt 4440 
tcggtggtta ataagcttat cgatgataag ctgtcaaaca tgagaatt 4488 



<210> 18 
<211> 1125 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TATcreStrepTag 

<220> 

<221> CDS 

<222> (1) . . (1119) 

<400> 18 

atg ggc tac ggc cgc aag aag cgc cgc caa cgc cgc cgc ggc atg tec 48 
Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
15 10 15 

aat tta ctg acc gta cac caa aat ttg cct gca tta ccg gtc gat gca 96 
Asn Leu Leu Thr Val His Gin Asn Leu Pro Ala Leu Pro Val Asp Ala 
20 25 30 

acg agt gat gag gtt cgc aag aac ctg atg gac atg ttc agg gat cgc 144 
Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg Asp Arg 
35 40 45 

cag gcg ttt tct gag cat acc tgg aaa atg ctt ctg tec gtt tgc egg 192 
Gin Ala Phe Ser Glu His Thr Trp Lys Met Leu Leu Ser Val Cys Arg 
50 55 60 
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teg tgg gcg gca tgg tgc aag ttg aat aac egg aaa tgg ttt ccc gca 240 

Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn Arg Lys Trp Phe Pro Ala 
65 70 75 80 

gaa cct gaa gat gtt cgc gat tat ctt eta tat ctt cag gcg cgc ggt 288 
Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gin Ala Arg Gly 
85 90 95 

ctg aca gta aaa act ate cag caa cat ttg ggc cag eta aac atg ctt 336 
Leu Thr Val Lys Thr lie Gin Gin His Leu Gly Gin Leu Asn Met Leu 
100 105 110 

cat cgt egg tec ggg ctg cca cga cca agt gac age aat get gtt tea 384 
His Arg Arg Ser Gly Leu Pro Arg Pro Ser Asp Ser Asn Ala Val Ser 
115 120 125 

ctg gtt atg egg egg ate cga aaa gaa aac gtt gat gec ggt gaa cgt 432 
Leu Val Met Arg Arg He Arg Lys Glu Asn Val Asp Ala Gly Glu Arg 
130 135 140 

gca aaa cag get eta gcg ttc gaa cgc act gat ttc gac cag gtt cgt 480 
Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr Asp Phe Asp Gin Val Arg 
.145 150 155 160 

tea etc atg gaa aat age gat cgc tgc cag gat ata cgt aat ctg gca 528 
Ser Leu Met Glu Asn Ser Asp Arg Cys Gin Asp He Arg Asn Leu Ala 
165 170 175 

ttt ctg ggg att get tat aac ace ctg tta cgt ata gec gaa att gee 576 
Phe Leu Gly He Ala Tyr Asn Thr Leu Leu Arg He Ala Glu He Ala 
180 185 190 

agg ate agg gtt aaa gat ate tea cgt act gac ggt ggg aga atg tta 624 
Arg He Arg Val Lys Asp He Ser Arg Thr Asp Gly Gly Arg Met Leu 
195 200 205 

ate cat att ggc aga acg aaa acg ctg gtt age acc gca ggt gta gag 672 
He His He Gly Arg Thr Lys Thr Leu Val Ser Thr Ala Gly Val Glu 
210 215 220 

aag gca ctt age ctg ggg gta act aaa ctg gtc gag cga tgg att tec 720 
Lys Ala Leu Ser Leu Gly Val Thr Lys Leu Val Glu Arg Trp He Ser 
225 230 235 240 

gtc tct ggt gta get gat gat ccg aat aac tac ctg ttt tgc egg gtc 7 68 
Val Ser Gly Val Ala Asp Asp Pro Asn Asn Tyr Leu Phe Cys Arg Val 
245 250 " 255 

aga aaa aat ggt gtt gee gcg cca tct gee acc age cag eta tea act 816 
Arg Lys Asn Gly Val Ala Ala Pro Ser Ala Thr Ser Gin Leu Ser Thr 
260 265 270 

cgc gee ctg gaa ggg att ttt gaa gca act cat cga ttg att tac ggc 864 
Arg Ala Leu Glu Gly He Phe Glu Ala Thr His Arg Leu He Tyr Gly 
275 280 285 

get aag gat gac tct ggt cag aga tac ctg gec tgg tct gga cac agt 912 
Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu Ala Trp Ser Gly His Ser 
290 295 300 

gec cgt gtc gga gee gcg cga gat atg gec cgc get gga gtt tea ata 960 
Ala Arg Val Gly Ala Ala Arg Asp Met Ala Arg Ala Gly Val Ser lie 
305 310 315 320 
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ccg gag ate atg caa get ggt ggc tgg acc aat gta aat att gtc atg 1008 

Pro Glu lie Met Gin Ala Gly Gly Trp Thr Asn Val Asn He Val Met 
325 330 335 

aac tat ate cgt aac ctg gat agt gaa aca ggg gca atg gtg cgc ctg 1056 
Asn Tyr He Arg Asn Leu Asp Ser Glu Thr Gly Ala Met Val Arg Leu 
340 345 350 

ctg gaa gat ggc gat ggt ate gaa ggt cgt ggt age get tgg cgt cac 1104 
Leu Glu Asp Gly Asp Gly He Glu Gly Arg Gly Ser Ala Trp Arg His 
355 360 365 

ccg cag ttc ggt ggt taataa 112 5 
Pro Gin Phe Gly Gly 
370 

<210> 19 
<211> 373 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TATcreStrepTag 

<400> 19 

Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
1 5 10 15 

Asn Leu Leu Thr Val His Gin Asn Leu Pro Ala Leu Pro Val Asp Ala 
20 25 30 

Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg Asp Arg 
35 40 45 

Gin Ala Phe Ser Glu His Thr Trp Lys Met Leu Leu Ser Val Cys Arc 
50 55 60 

Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn Arg Lys Trp Phe Pro Ala 
65 70 75 80 

Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gin Ala Arg Gly 
85 90 95 

Leu Thr Val Lys Thr He Gin Gin His Leu Gly Gin Leu Asn Met Leu 
100 105 HO 

His Arg Arg Ser Gly Leu Pro Arg Pro Ser Asp Ser Asn Ala Val Ser 
115 120 125 

Leu Val Met Arg Arg He Arg Lys Glu Asn Val Asp Ala Gly Glu Arq 
130 135 140 

Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr Asp Phe Asp Gin Val Arg 
145 150 155 *" 160 

Ser Leu Met Glu Asn Ser Asp Arg Cys Gin Asp He Arg Asn Leu Ala 
165 170 " 175 

Phe Leu Gly He Ala Tyr Asn Thr Leu Leu Arg He Ala Glu He Ala 
180 185 190 

Arg He Arg Val Lys Asp He Ser Arg Thr Asp Gly Gly Arg Met Leu 
195 200 205 
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lie His He Gly Arg Thr Lys Thr Leu Val Ser Thr Ala Gly Val Glu 
210 215 220 

Lys Ala Leu Ser Leu Gly Val Thr Lys Leu Val Glu Arg Trp He Ser 
225 230 235 * 240 

Val Ser Gly Val Ala Asp Asp Pro Asn Asn Tyr Leu Phe Cys Arg Val 
245 250 255 

Arg Lys Asn Gly Val Ala Ala Pro Ser Ala Thr Ser Gin Leu Ser Thr 
260 265 270 

Arg Ala Leu Glu Gly He Phe Glu Ala Thr His Arg Leu' lie Tyr Gly 
275 280 285 

Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu Ala Trp Ser Gly His Ser 
290 295 300 

Ala Arg Val Gly Ala Ala Arg Asp Met Ala Arg Ala Gly Val Ser He 
305 310 315 "* 320 

Pro Glu He Met Gin Ala Gly Gly Trp Thr Asn Val Asn He Val Met 
325 330 335 

Asn Tyr He Arg Asn Leu Asp Ser Glu Thr Gly Ala Met Val Arg Leu 
340 345 350 

Leu Glu Asp Gly Asp Gly He Glu Gly Arg Gly Ser Ala Trp Arg His 
355 360 365 

Pro Gin Phe Gly Gly 
370 

<210> 20 
<211> 2055 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22creStrepTag 

<220> 

<221> CDS 

<222> (1) . . (2049) 

<400> 20 

atg acc tct cgc cgc tec gtg aag teg ggt ccg egg gag gtt ccg cgc 48 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
15 10 15 

gat gag tac gag gat ctg tac tac acc ccg tct tea ggt atg gcg agt 96 
Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 30 

ccc gat agt ccg cct gac acc tec cgc cgt ggc gee eta cag aca cgc 144 
Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 

teg cgc cag agg ggc gag gtc cgt ttc gtc cag tac gac gag teg gat 192 
Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 55 60 
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tat gcc etc tac ggg ggc teg tct tec gaa gac gac gaa cac ccg gag 240 

Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

gtc ccc egg acg egg cgt ccc gtt tec ggg gcg gtt ttg tec ggc ccg 288 
Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

ggg cct gcg egg gcg cct ccg cca ccc get ggg tec gga ggg gcc gga 336 
Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 110 

cgc aca ccc acc acc gcc ccc egg gcc ccc cga acc cag egg gtg gcg 384 
Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
115 120 125 

t.ct aag gcc ccc gcg gcc ccg gcg gcg gag acc acc cgc ggc agg aaa 432 
Ser Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 
130 135 140 

teg gcc. cag cca gaa tec gcc gca etc cca gac gcc ccc gcg teg acg 4 80 
Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

gcg cca acc cga tec aag aca ccc gcg cag ggg ctg gcc aga aag ctg 528 
Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

cac ttt age acc gcc ccc cca aac ccc gac gcg cca tgg acc ccc egg 576 
His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

gtg gcc ggc ttt aac aag cgc gtc ttc tgc gcc gcg gtc ggg cgc ctg 624 
Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 " 200 ~ 205 

gcg gcc atg cat gcc egg atg gcg get gtc cag etc tgg gac atg teg 672 
Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

cgt ccg cgc aca gac gaa gac etc aac gaa etc ctt ggc ate acc acc 720 
Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly He Thr Thr 
225 • 230 235 240 

ate cgc gtg acg gtc tgc gag ggc aaa aac ctg ctt cag cgc gcc aac 768 
He Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 255 

gag ttg gtg aat cca gac gtg gtg cag gac gtc gac gcg gcc acg gcg 816 
Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

act cga ggg cgt tct gcg gcg teg cgc ccc acc gag cga cct cga gcc 864 
Thr Arg Gly Arg Ser Ala Ala Ser Arg "Pro Thr Glu Arg Pro Arg Ala 
275 280 285 

cca gcc cgc tec get tct cgc ccc aga egg ccc gtc gag ggt acc gag 912 
Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 

etc gga tec act agt cca gtg tgg tgg aat tct gca gat ate cag cac 960 
Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp He Gin His 
305 310 315 320 
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agt ggc 
Ser Gly 


ggc cgc 
Gly Arg 


atg 
Met 


tec aat 
Ser Asn 


tta 
Leu 


ctg 
Leu 


acc 
Thr 
330 


gta 
Val 


cac 
His 


caa 
Gin 


aat 
Asn 


ttg 
Leu 
335 


cct 
Pro 


1008 


gca tta 
Ala Leu 


ccg gtc 
Pro Val 
340 


gat 
Asp 


gca acg 
Ala Thr 


agt 

Ser 


gat 
Asp 

^4^ 


gag 
Glu 


gtt 

Val 


cgc 
Arg 


aag 
Lys 


aac 
Asn 
350 


ctg 
Leu 


atg 
Met 


1056 


gac atg 
Asp Met 


ttc agg 
Phe Arg 
355 


gat 
Asp 


cgc cag 
Arg Gin 


gcg 
Ala 

JDU 


ttt 

Phe 


tct 
Ser 


gag 
Glu 


cat 
His 


acc 
Thr 

ICC 
ODD 


tgg 
Trp 


aaa 
Lys 


atg 
Met 


1104 


ctt . ctg 
Leu Leu 


tec gtt 
Ser Val 


tgc 
Cys 


egg teg 
Arg Ser 

"3 *7 tr 


tgg 
Trp 


gcg 
Ala 


gca 
Ala 


tgg 
Trp 


tgc 
Cys 
380 


aag 
Lys 


ttg aat 
Leu Asn 


aac 
Asn 


1152 


egg aaa 
Arg Lys 

JO u 


tgg ttt 
Trp Phe 


ccc 
Pro 


gca gaa 
Ala Glu 


cct 
Pro 


gaa 
Glu 


gat 
Asp 


gtt 

Val 

o o c 

395 


cgc 
Arg 


gat 
Asp 


tat 
Tyr 


ctt 
Leu 


eta 
Leu 
400 


1200 


tat ctt 
Tyr Leu 


cag gcg 
Gin Ala 


cgc 
Arg 


ggt ctg 
Gly Leu 


gca 
Ala 


gta 
Val 


aaa 
Lys 

A 1 C\ 
fill) 


act 
Thr 


ate 
He 


cag 
Gin 


caa 
Gin 


cat 
His 
415 


ttg 
Leu 


1248 


ggc cag 
Gly Gin 


eta aac 
Leu Asn 


atg 
Met 


ctt cat 
Leu His 


cgt 
Arg 


egg 
Arg 


tec 
Ser 


ggg 

Gly 


ctg 
Leu 


cca 
Pro 


cga 
Arg 
430 


cca 
Pro 


agt 
Ser 


1296 


gac age 
Asp Ser 


aat get 
Asn Ala 
435 


gtt 

Val 


tea ctg 
Ser Leu 


gtt 
Val 

*i H U 


atg 
Met 


egg 
Arg 


egg 
Arg 


ate 
He 


cga 
Arg 

A A C, 
H.HD 


aaa 
Lys 


gaa 
Glu 


aac 
Asn 


1344 


gtt gat 
Val Asp 
450 


gee ggt 
Ala Gly 


gaa 
Glu 


cgt gca 
Arg Ala 

** *j «j 


aaa 
Lys 


cag 
Gin 


get 
Ala 


eta 
Leu 


gcg 

Ala 

ft DU 


ttc 
Phe 


gaa cgc 
Glu Arg 


act 
Thr 


1392 


gat ttc 
Asp Phe 

4GR 


gac cag 
Asp Gin 


gtt 
Val 


cgt tea 
Arg Ser 

H l u 


etc 
Leu 


atg 
Met 


gaa 
Glu 


aat 
Asn 
4 75 


age 
Ser 


gat 
Asp 


cgc tgc 
Arg Cys 


cag 
Gin 
480 


1440 


gat ata 
Asp lie 


cgt aat 
Arg Asn 


ctg 
Leu 

4 R R 


gca ttt 
Ala Phe 


ctg 
Leu 


ggg 

Gly 


att 

He 
a on 


get 
Ala 


tat 
Tyr 


aac 
Asn 


acc 
Thr 


ctg 
Leu 
495 


tta 
Leu 


1488 


cgt ata 
Arg He 


gec gaa 
Ala Glu 
500 


att 
He 


gec agg 
Ala Arg 


ate 
He 


agg 
Arg 


gtt 

Val 


aaa 
Lys 


gat 
Asp 


ate 
He 


tea cgt 
Ser Arg 
510 


act 
Thr 


1536 


gac ggt 
Asp Gly 


ggg aga 
Gly Arg 
515 


atg 
Met 


tta ate 
Leu He 


cat 
His 


att 
He 


ggc 
Gly 


aga 
Arg 


acg 
Thr 


aaa 
Lys 


acg ctg 
Thr Leu 


gtt 
Val 


1584 


age acc 
Ser Thr 
530 


gca ggt 
Ala Gly 


gta 
Val 


gag aag 
Glu Lys 
535 


gca 
Ala 


ctt 
Leu 


age 
Ser 


ctg 
Leu 


ggg 

Gly 
540 


gta 
Val 


act aaa 
Thr Lys 


ctg 
Leu 


1632 


gtc gag 
Val Glu 
545 


cga tgg 
Arg Trp 


att 
He 


tec gtc 
Ser Val 
550 


tct 
Ser 


ggt 
Gly 


gta 
Val 


get 
Ala 
555 


gat 
Asp 


gat 
Asp 


ccg 
Pro 


aat 
Asn 


aac 
Asn 
560 


1680 


tac ctg 
Tyr Leu 


ttt tgc 
Phe Cys 


egg 
Arg 
565 


gtc aga 
Val Arg 


aaa 
Lys 


aat 
Asn 


ggt 
Gly 
570 


gtt 

Val 


gec 
Ala 


gcg 

Ala 


cca 
Pro 


tct 
Ser 
575 


gee 
Ala 


1728 
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acc age cag eta tea act cgc gee ctg gaa ggg att ttt gaa gca act 1776 

Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly lie Phe Glu Ala Thr 
580 585 590 

cat cga ttg att tac ggc get aag gat gac tct ggt cag aga tac ctg 1824 
His Arg Leu lie Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
595 600 605 

gec tgg tct gga cac agt gec cgt gtc gga gec gcg cga gat atg gec 1872 
Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
610 615 ^ 620 

i _i 

cgc get gga gtt tea ata ccg gag ate atg caa get ggt ggc tgg acc 1920 
Arg Ala Gly Val Ser He Pro Glu He Met Gin Ala Gly Gly Trp Thr 
625 630 635 640 

aat gta aat att gtc atg aac tat ate cgt aac ctg gat agt gaa aca 1968 
Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
645 650 * 655 

ggg gca atg gtg cgc ctg ctg gaa gat ggc gat ggt ate gaa ggt cgt 2016 
Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp Gly He Glu Gly Arg 
660 665 670 

ggt age get tgg cgt cac ccg cag ttc ggt ggt taataa 2055 
Gly Ser Ala Trp Arg His Pro Gin Phe Gly Gly 
675 680 



<210> 21 
<211> 683 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22creStrepTag 

<400> 21 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
1 5 10 15 

Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 30 

Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 

Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
•50 55 60 

Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 

100 105 HO/""} 

Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
115 120 125 

Ser Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 
130 135 140 
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Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 ~* 175 

His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 ._ 

■: r 

Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly He Thr Thr 
225 230 235 " 240 

He Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 255 

Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
275 280 285 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp He Gin His 
305 310 315 320 

Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
325 330 335 

Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
340 345 350 

Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
355 360 365 

Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 
370 375 * 380 

Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
385 390 395 400 

Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys . Thr He Gin Gin His Leu 
405 410 415 

Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
420 425 430 

Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg He Arg Lys Glu Asn 
435 440 445 

Val Asp Ala Gly Glu Arg Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr 
450 455 460 



Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 
465 470 475 480 
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Asp He Arg Asn Leu Ala Phe Leu Gly He Ala Tyr Asn Thr Leu Leu 
485 490 495 

Arg He Ala Glu He Ala Arg lie Arg Val Lys Asp He Ser Arg Thr 
500 505 * 510 

Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 
515 520 525 

Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
530 535 540 

i / 

Val Glu Arg Trp He Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
545 550 555 560 

Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
565 570 575 

Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly He Phe Glu Ala Thr 
580 585 590 

His Arg Leu He Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
595 600 605 

Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
610 615 620 

Arg Ala Gly Val Ser He Pro Glu lie Met Gin Ala Gly Gly Trp Thr 
625 630 635 ~ 640 

Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
645 650 ' 655 

Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp Gly He Glu Gly Arg 
660 665 ** 670 

Gly Ser Ala Trp Arg His Pro Gin Phe Gly Gly 
675 680 



<210> 22 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 22 

Ala Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg 
1 5 10 



<210> 23 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; synthetic TAT 
protein 
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<400> 23 

Tyr Ala Arg Lys Ala Arg Arg Gin Ala Arg Arg 
15 10 



<210> 24 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 24 

Tyr Ala Arg Ala Ala Ala Arg Gin Ala Arg Ala 
15 10 



<210> 25 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 25 

Tyr Ala Arg Ala Ala Arg Arg Ala Ala Arg Arg 
15 10 



<210> 26 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 26 

Tyr Ala Arg Ala Ala Arg Arg Ala Ala Arg Ala 
15 10 



<210> 27 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 27 

Tyr Ala Arg Arg Arg Arg Arg Arg Arg Arg Arg 
15 10 



<210> 28 
<211> 11 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 28 

Tyr Ala Ala Ala Arg Arg Arg Arg Arg Arg Arg 
1 5 10 



<210> 29 
<211> 4960 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
pCMV-I-Cre-pA 



vector 



<400> 29 

aaacagtccg 

tagtaatcaa 

cttacggtaa 

atgacgtatg 

tatttacggt 

cctattgacg 

tgggactttc 

cggttttggc 

ctccacccca 

aaatgtcgta 

gtctatataa 

attaatacga 

tgagtactcc 

cgaggaggat 

ctggtcagaa 

gccatacact 

cagggcggcc 

ccaaaatttg 

ggacatgttc 

ttgccggtcg 

tgaagatgtt 

ccagcaacat 

tgacagcaat 

tgaacgtgca 

catggaaaat 

taacaccctg 

tgacggtggg 

tgtagagaag 

tggtgtagct 

cgcgccatct 

tcatcgattg 

acacagtgcc 

gatcatgcaa 

ggatagtgaa 

taaatgattg 

caataaaaga 

gggggaggcc 

agggggaggc 

accgagctcg 

tacccaactt 

ggcccgcacc 

gcggtatttt 

tacaatctgc 



atgtacgggc 
ttacggggtc 
atggcccgcc 
ttcccatagt 
aaactgccca 
tcaatgacgg 
ctacttggca 
agtacatcaa 
ttgacgtcaa 
acaactccgc 
gcagagctct 
ctcactatag 
ctctcaaaag 
ttgatattca 
aagacaatct 
tgagtgacat 
tcgaccatgc 
cctgcattac 
agggatcgcc 
tgggcggcat 
cgcgattatc 
ttgggccagc 
gctgtttcac 
aaacaggctc 
agcgatcgct 
ttacgtatag 
agaatgttaa 
gcacttagcc 
gatgatccga 
gccaccagcc 
atttacggcg 
cgtgtcggag 
gctggtggct 
acaggggcaa 
cagatccact 
tcattatttt 
agaatgaggc 
cagaatgacc 
aattcactgg 
aatcgccttg 
gatcgccctt 
ctccttacgc 
tctgatgccg 



cagatatacg 
attagttcat 
tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tgggcgtgga 
tgggagtttg 
cccattgacg 
ctggctaact 
ggagacccaa 
cgggcatgac 
cctggcccgc 
ttttgttgtc 
tgacatccac 
ccaagaagaa 
cggtcgatgc 
aggcgttttc 
ggtgcaagtt 
ttctatatct 
taaacatgct 
tggttatgcg 
tagcgttcga 
gccaggatat 
ccgaaattgc 
tccatattgg 
tgggggtaac 
ataactacct 
agctatcaac 
ctaaggatga 
ccgcgcgaga 
ggaccaatgt 
tggtgcgcct 
agttctaggg 
caatagatct 
gcggccaagg 
ttgggggagg 
ccgtcgtttt 
cagcacatcc 
cccaacagtt 
atctgtgcgg 
catagttaag 



cgttgacatt 
agcccatata 
cccaacgacc 
gggactttcc 
catcaagtgt 
gcctggcatt 
gtattagtca 
tagcggtttg 
ttttggcacc 
caaatgggcg 
agagaaccca 
gctgactcta 
ttctgcgcta 
ggtgatgcct 
aagcttgagg 
tttgcctttc 
gaggaaggtg 
aacgagtgat 
tgagcatacc 
gaataaccgg 
tcaggcgcgc 
tcatcgtcgg 
gcggatccga 
acgcactgat 
acgtaatctg 
caggatcagg 
cagaacgaaa 
taaactggtc 
gttttgccgg 
tcgcgccctg 
ctctggtcag 
tatggcccgc 
aaatattgtc 
gctggaagat 
ccgcgtcgac 
gtgtgttggt 
gggaggggga 
gggaggccag 
acaacgtcgt 
ccctttcgcc 
gcgcagcctg 
tatttcacac 
ccagccccga 



gattattgac 
tggagttccg 
cccgcccatt 
attgacgtca 
atcatatgcc 
atgcccagta 
tcgctattac 
actcacgggg 
aaaatcaacg 
gtaggcgtgt 
ctgcttactg 
gacttaatta 
agattgtcag 
ttgagggtgg 
tgtggcaggc 
tctccacagg 
tccaatttac 
gaggttcgca 
tggaaaatgc 
aaatggtttc 
ggtctggcag 
tccgggctgc 
aaagaaaacg 
ttcgaccagg 
gcatttctgg 
gttaaagata 
acgctggtta 
gagcgatgga 
gtcagaaaaa 
gaagggattt 
agatacctgg 
gctggagttt 
atgaactata 
ggcgattagc 
ctcgagatcc 
tttttgtgtg 
ggccagaatg 
aatgaggcgc 
gactgggaaa 
agctggcgta 
aatggcgaat 
cgcatatggt 
cacccgccaa 



tagttattaa 

cgttacataa 

gacgtcaata 

atgggtggac 

aagtacgccc 

catgacctta 

catggtgatg 

atttccaagt 

ggactttcca 

acggtgggag 

gcttatcgaa 

agcgttgggg 

tttccaaaaa 

ccgcgtccat 

ttgagatctg 

tgtccactcc 

tgaccgtaca 

agaacctgat 

ttctgtccgt 

ccgcagaacc 

taaaaactat 

cacgaccaag 

ttgatgccgg 

ttcgttcact 

ggattgctta 

tctcacgtac 

gcaccgcagg 

tttccgtctc 

atggtgttgc 

ttgaagcaac ' 

cctggtctgg 

caataccgga 

tccgtaacct 

cattaacgcg 

aggcgcggat 

ccttggggga 

accttggggg 

gcccccgggt 

accctggcgt 

atagcgaaga 

ggcgcctgat 

gcactctcag 

cacccgctga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 
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cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc 2640 
cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga gacgaaaggg 2700 
cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt cttagacgtc 27 60 
aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca 2820 
ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa 2880 
aaggaagagt atgagtattc aacatttccg tgtcgccctt. attccctttt ttgcggcatt 2940 
ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca 3000 
gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag 3060 
ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc 3120 
ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca 3180 
gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt 3240 
aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca, acttacttct 3300 
gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt 3360 
aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga 3420 
caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact 34 80 
tactctagct tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc 3540 
acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga 3600 
gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt 3660 
agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga 3720 
gataggtgcc tcactgatta agcattggta actgtcagac caagtttact catatatact 3780 
ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga 3840 
taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt 3900 
agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca 3960 
aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct 4020 
ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc ttctagtgta 4080 
gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct 4140 
aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc 4200 
aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca 4260 
gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga 4320 
aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg 4380 
aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt 4440 
cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag 4500 
cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt 4560 
tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt 4620 
tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga 4680 
ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta 4740 
atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca acgcaattaa 4800 
tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat 4860 
gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg accatgatta 4920 
cgccaagcta gcccgggcta gcttgcatgc ctgcaggttt t 4960 



<210> 30 
<211> 7332 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pCMV-I-beta-pA 

<400> 30 

aaacagtccg atgtacgggc cagatatacg cgttgacatt gattattgac tagttattaa 60 

tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg cgttacataa 120 

cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt gacgtcaata 180 

atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca atgggtggac 240 

tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc aagtacgccc 300 

cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta catgacctta 360 

tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac catggtgatg 420 

cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg atttccaagt 4 80 

ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg ggactttcca 540 

aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt acggtgggag 600 

gtctatataa gcagagctct ctggctaact agagaaccca ctgcttactg gcttatcgaa 660 

attaatacga ctcactatag ggagacccaa gctgactcta gacttaatta agcgttgggg 720 
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tgagtactcc ctctcaaaag cgggcatgac ttctgcgcta agattgtcag tttccaaaaa 780 
cgaggaggat ttgatattca cctggcccgc ggtgatgcct ttgagggtgg ccgcgtccat 840 
ctggtcagaa aagacaatct ttttgttgtc aagcttgagg tgtggcaggc ttgagatctg 900 
gccatacact tgagtgacat tgacatccac tttgcctttc tctccacagg tgtccactcc 960 
cagggcggcc gcaattcccg gggatcgaaa gagcctgcta aagcaaaaaa gaagtcacca 1020 
tgtcgtttac tttgaccaac aagaacgtga ttttcgttgc cggtctggga ggcattggtc 1080 
tggacaccag caaggagctg ctcaagcgcg atcccgtcgt tttacaacgt cgtgactggg 1140 
aaaaccctgg cgttacccaa cttaatcgcc ttgcagcaca tccccctttc gccagctggc 1200 
gtaatagcga agaggcccgc accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg 1260 
aatggcgctt tgcctggttt ccggcaccag aagcggtgcc ggaaagctgg ctggagtgcg 1320 
atcttcctga ggccgatact gtcgtcgtcc cctcaaactg gcagatgcac ggttacgatg 1380 
cgcccatcta caccaacgta acctatccca ttacggtcaa tccgccgttt gttcccacgg 1440 
agaatccgac gggttgttac tcgctcacat ttaatgttga tgaaagctgg ctacaggaag 1500 
gccagacgcg aattattttt gatggcgtta actcggcgtt tcatctgtgg tgcaacgggc 1560 
gctgggtcgg ttacggccag gacagtcgtt tgccgtctga atttgacctg agcgcatttt 1620 
tacgcgccgg agaaaaccgc ctcgcggtga tggtgctgcg ttggagtgac ggcagttatc 1680 
tggaagatca ggatatgtgg cggatgagcg gcattttccg tgacgtctcg ttgctgcata 1740 
aaccgactac acaaatcagc gatttccatg ttgccactcg ctttaatgat gatttcagcc 1800 
gcgctgtact ggaggctgaa gttcagatgt gcggcgagtt gcgtgactac ctacgggtaa 1860 
cagtttcttt atggcagggt gaaacgcagg tcgccagcgg caccgcgcct ttcggcggtg 1920 
aaattatcga tgagcgtggt ggttatgccg atcgcgtcac actacgtctg aacgtcgaaa 1980 
acccgaaact gtggagcgcc gaaatcccga atctctatcg tgcggtggtt gaactgcaca 2040 
ccgccgacgg cacgctgatt gaagcagaag cctgcgatgt cggtttccgc gaggtgcgga 2100 
ttgaaaatgg tctgctgctg ctgaacggca agccgttgct gattcgaggc gttaaccgtc 2160 
acgagcatca tcctctgcat ggtcaggtca tggatgagca gacgatggtg caggatatcc 2220 
tgctgatgaa gcagaacaac tttaacgccg tgcgctgttc gcattatccg aaccatccgc 2280 
tgtggtacac gctgtgcgac cgctacggcc tgtatgtggt ggatgaagcc aatattgaaa 2340 
cccacggcat ggtgccaatg aatcgtctga ccgatgatcc gcgctggcta ccggcgatga 2400 
gcgaacgcgt aacgcgaatg gtgcagcgcg atcgtaatca cccgagtgtg atcatctggt 2460 
cgctggggaa tgaatcaggc cacggcgcta atcacgacgc gctgtatcgc tggatcaaat 2520 
ctgtcgatcc ttcccgcccg gtgcagtatg aaggcggcgg agccgacacc acggccaccg 2580 
atattatttg cccgatgtac gcgcgcgtgg atgaagacca gcccttcccg gctgtgccga 2640 
aatggtccat caaaaaatgg ctttcgctac ctggagagac gcgcccgctg atcctttgcg 2700 
aatacgccca cgcgatgggt aacagtcttg gcggtttcgc taaatactgg caggcgtttc 27 60 
gtcagtatcc ccgtttacag ggcggcttcg tctgggactg ggtggatcag tcgctgatta 2820 
aatatgatga aaacggcaac ccgtggtcgg cttacggcgg tgattttggc gatacgccga 2880 
acgatcgcca gttctgtatg aacggtctgg tctttgccga ccgcacgccg catccagcgc 2940 
tgacggaagc aaaacaccag cagcagtttt tccagttccg tttatccggg caaaccatcg 3000 
aagtgaccag cgaatacctg ttccgtcata gcgataacga gctcctgcac tggatggtgg 3060 
cgctggatgg taagccgctg gcaagcggtg aagtgcctct ggatgtcgct ccacaaggta 3120 
aacagttgat tgaactgcct gaactaccgc agccggagag cgccgggcaa ctctggctca 3180 
cagtacgcgt agtgcaaccg aacgcgaccg catggtcaga agccgggcac atcagcgcct 3240 
ggcagcagtg gcgtctggcg gaaaacctca gtgtgacgct ccccgccgcg tcccacgcca 3300 
tcccgcatct gaccaccagc gaaatggatt tttgcatcga gctgggtaat aagcgttggc 3360 
aatttaaccg ccagtcaggc tttctttcac agatgtggat tggcgataaa aaacaactgc 3420 
tgacgccgct gcgcgatcag ttcacccgtg caccgctgga taacgacatt ggcgtaagtg 3480 
aagcgacccg cattgaccct aacgcctggg tcgaacgctg gaaggcggcg ggccattacc 3540 
aggccgaagc agcgttgttg cagtgcacgg cagatacact tgctgatgcg gtgctgatta 3600 
cgaccgctca cgcgtggcag catcagggga aaaccttatt tatcagccgg aaaacctacc 3660 
ggattgatgg tagtggtcaa atggcgatta ccgttgatgt tgaagtggcg agcgatacac 3720 
cgcatccggc gcggattggc ctgaactgcc agctggcgca ggtagcagag cgggtaaact 3780 
ggctcggatt agggccgcaa gaaaactatc ccgaccgcct tactgccgcc tgttttgacc 3840 
gctgggatct gccattgtca gacatgtata ccccgtacgt cttcccgagc gaaaacggtc 3900 
tgcgctgcgg gacgcgcgaa ttgaattatg gcccacacca gtggcgcggc gacttccagt 3960 
tcaacatcag ccgctacagt caacagcaac tgatggaaac cagccatcgc catctgctgc 4020 
acgcggaaga aggcacatgg ctgaatatcg acggtttcca tatggggatt ggtggcgacg 4080 
actcctggag cccgtcagta tcggcggaat tacagctgag cgccggtcgc taccattacc 4140 
agttggtctg gtgtcaaaaa taataataac cgggcaggcc atgtctgccc gtatttcgcg 4200 
taaggaaatc cattatgtac tatttaaaaa acacaaactt ttggatgttc ggtttattct 4260 
ttttctttta cttttttatc atgggagcct acttcccgtt tttcccgatt tggctacatg 4320 
acatcaacca tatcagcaaa agtgatacgg gtattatttt tgccgctatt tctctgttct 4 380 
cgctattatt ccaaccgctg tttggtctgc' tttctgacaa actcggcctc gactctaggc 4440 
ggccgcgtcg acctcgagat ccaggcgcgg atcaataaaa gatcattatt ttcaatagat 4500 
ctgtgtgttg gttttttgtg tgccttgggg gagggggagg ccagaatgag gcgcggccaa 4560 
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gggggagggg gaggccagaa tgaccttggg 
gggggaggcc agaatgaggc gcgcccccgg 
ttacaacgtc gtgactggga aaaccctggc 
ccccctttcg ccagctggcg taatagcgaa 
ttgcgcagcc tgaatggcga atggcgcctg 
ggtatttcac accgcatatg gtgcactctc 
agccagcccc gacacccgcc aacacccgct 
gcatccgctt acagacaagc tgtgaccgtc 
ccgtcatcac cgaaacgcgc gagacgaaag 
aatgtcatga taataatggt ttcttagacg 
ggaaccccta tttgtttatt tttctaaata 
taaccctgat aaatgcttca ataatattga 
cgtgtcgccc ttattccctt ttttgcggca 
acgctggtga aagtaaaaga tgctgaagat 
ctggatctca acagcggtaa gatccttgag 
atgagcactt ttaaagttct gctatgtggc 
gagcaactcg gtcgccgcat acactattct 
acagaaaagc atcttacgga tggcatgaca 
atgagtgata acactgcggc caacttactt 
accgcttttt tgcacaacat gggggatcat 
ctgaatgaag ccataccaaa cgacgagcgt 
acgttgcgca aactattaac tggcgaacta 
gactggatgg aggcggataa agttgcagga 
tggtttattg ctgataaatc tggagccggt 
ctggggccag atggtaagcc ctcccgtatc 
actatggatg aacgaaatag acagatcgct 
taactgtcag accaagttta ctcatatata 
tttaaaagga tctaggtgaa gatccttttt 
gagttttcgt tccactgagc gtcagacccc 
cctttttttc tgcgcgtaat ctgctgcttg 
gtttgtttgc cggatcaaga gctaccaact 
gcgcagatac caaatactgt ccttctagtg 
tctgtagcac cgcctacata cctcgctctg 
ggcgataagt cgtgtcttac cgggttggac 
cggtcgggct gaacgggggg ttcgtgcaca 
gaactgagat acctacagcg tgagctatga 
gcggacaggt atccggtaag cggcagggtc 
gggggaaacg cctggtatct ttatagtcct 
cgatttttgt gatgctcgtc aggggggcgg 
tttttacggt tcctggcctt ttgctggcct 
cctgattctg tggataaccg tattaccgcc 
cgaacgaccg agcgcagcga gtcagtgagc 
ccgcctctcc ccgcgcgttg gccgattcat 
tggaaagcgg gcagtgagcg caacgcaatt 
caggctttac actttatgct tccggctcgt 
tttcacacag gaaacagcta tgaccatgat 
gcctgcaggt tt 



ggagggggag gccagaatga ccttggggga 4 620 
gtaccgagct cgaattcact ggccgtcgtt 4 680 
gttacccaac ttaatcgcct tgcagcacat 4740 
gaggcccgca ccgatcgccc ttcccaacag 4800 
atgcggtatt ttctccttac gcatctgtgc 4860 
agtacaatct gctctgatgc cgcatagtta 4 920 
gacgcgccct gacgggcttg tctgctcccg 4980 
tccgggagct gcatgtgtca gaggttttca 5040 
ggcctcgtga tacgcctatt tttataggtt 5100 
tcaggtggca cttttcgggg aaatgtgcgc 5160 
cattcaaata tgtatccgct catgagacaa 5220 
aaaaggaaga gtatgagtat tcaacatttc 5280 
ttttgccttc ctgttttt'gc tcacccagaa 5340 
cagttgggtg cacgagtggg ttacatcgaa 5400 
agttttcgcc ccgaagaacg ttttccaatg 54 60 
gcggtattat cccgtattga cgccgggcaa 5520 
cagaatgact tggttgagta ctcaccagtc 5580 
gtaagagaat tatgcagtgc tgccataacc 5640 
ctgacaacga tcggaggacc gaaggagcta 5700 
gtaactcgcc ttgatcgttg ggaaccggag 5760 
gacaccacga tgcctgtagc aatggcaaca 5820 
cttactctag cttcccggca acaattaata 5880 
ccacttctgc gctcggccct tccggctggc 5940 
gagcgtgggt ctcgcggtat cattgcagca 6000 
gtagttatct acacgacggg gagtcaggca 6060 
gagataggtg cctcactgat taagcattgg 6120 
ctttagattg atttaaaact tcatttttaa 6180 
gataatctca tgaccaaaat cccttaacgt 6240 
gtagaaaaga tcaaaggatc ttcttgagat 6300 
caaacaaaaa aaccaccgct accagcggtg 6360 
ctttttccga aggtaactgg cttcagcaga 6420 
tagccgtagt taggccacca cttcaagaac 6480 
ctaatcctgt taccagtggc tgctgccagt 6540 
tcaagacgat agttaccgga taaggcgcag 6600 
cagcccagct tggagcgaac gacctacacc 6660 
gaaagcgcca cgcttcccga agggagaaag 6720 
ggaacaggag agcgcacgag ggagcttcca 6780 
gtcgggtttc gccacctctg acttgagcgt 6840 
agcctatgga aaaacgccag caacgcggcc 6900 
tttgctcaca tgttctttcc tgcgttatcc 6960 
tttgagtgag ctgataccgc tcgccgcagc 7020 
gaggaagcgg aagagcgccc aatacgcaaa 7080 
taatgcagct ggcacgacag gtttcccgac 7140 
aatgtgagtt agctcactca ttaggcaccc 7200 
atgttgtgtg gaattgtgag cggataacaa 7260 
tacgccaagc tagcccgggc tagcttgcat 7320 

7332 



<210> 31 
<211> 72 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 31 

atgccatggg ctacggccgc aagaagcgcc gccaacgccg ccgcggcatg tccaatttac 60 
tgaccgtaca cc 72 



<210> 32 
<211> 25 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 32 

tttcggatcc gccgcataac cagtg 

<210> 33 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 33 

tatatctaga ccatgggcta cggccgcaag aagc 

<210> 34 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 34 

gctaccacga ccttcgatac catcgccatc ttccagcagg cgc 

<210> 35 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 35 

taactagcgg ccgcatgtcc aatttactga ccgtacac 

<210> 36 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 36 

tcgagcggcc gccatcgcca tcttccagca ggcg 

<210> 37 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: primer 
<400> 37 

tatatctaga catatgacct ctcgccgctc eg 32 

<210> 38 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 38 

ttccgaagac gacgaaacac c 21 

<210> 39 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 39 

tatattcgaa gcttattaac caccgaactg eg 32 



<210> 4 0 
<211> 4847 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pGK-cre-pA 

<400> 40 

aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca 60 
ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa 120 
aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt ttgeggcatt 180 
ttgccttcct gtttttgetc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca 240 
gttgggtgca cgagtgggtt acatcgaact ggatctcaac ageggtaaga tccttgagag 300 
ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc 360 
ggtattatcc cgtattgacg cegggcaaga geaacteggt cgccgcatac actattctca 420 
gaatgacttg gttgagtact caccagtcac agaaaagcat ettaeggatg gcatgacagt 480 
aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct 540 
gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt 600 
aactcgcctt gatcgttggg aaceggaget gaatgaagee ataccaaacg acgagcgtga 660 
caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact 720 
tactctagct tcccggcaac aattaataga ctggatggag gcggataaag ttgeaggace 780 
acttctgcgc tcggcccttc cggctggctg gtttattget gataaatctg gagccggtga 840 
gcgtgggtct cgeggtatea ttgeagcact ggggecagat ggtaagcect cccgtatcgt 900 
agttatctac aegaegggga gtcaggcaac tatggatgaa cgaaatagac agategctga 960 
gataggtgee tcactgatta agcattggta actgtcagac caagtttact catatatact 1020 
ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga 1080 
taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagegt cagaccccgt 1140 
agaaaagatc aaaggatctt cttgagatcc tttttttctg cgegtaatet getgettgea 1200 
aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct 1260 
ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc ttctagtgta 1320 
geegtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct 1380 
aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc 1440 
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aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca 1500 
gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga 1560 
aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg 1620 
aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt 1680 
cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag 1740 
cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt 1800 
tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt 1860 
tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga 1920 
ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta 1980 
atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca acgcaattaa 2040 
tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat 2100 
gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg accatgatta 2160 
cgccaagcgc gcaattaacc ctcactaaag ggaacaaaag ctgggtaccg ggccccccct 2220 
cgaggtcgac ggtatcgata agcttgatat cgaattctac cgggtagggg aggcgctttt 2280 
cccaaggcag tctggagcat gcgctttagc agccccgctg gcacttggcg ctacacaagt 2340 
ggcctctggc ctcgcacaca ttccacatcc accggtagcg ccaaccggct ccgttctttg 2400 
gtggcccctt cgcgccactt ctactcctcc cctagtcagg aagtttcccc cagcaagctc 24 60 
gcgtcgtgca ggacgtgaca aatggaagta gcacgtctca ctagtctcgt gcagatggac 2520 
agcaccgctg agcaatggaa gcgggtaggc ctttggggca gcggccaata gcagctttgt 2580 
tccttcgctt tctgggctca gaggctggga aggggtgggt ccgggggcgg gctcaggggc 2640 
gggctcaggg gcgggcgggc gcccgaaggt cctcccgagg cccggcattc tgcacgcttc 2700 
aaaagcgcac gtctgccgcg ctgttctcct cttcctcatc tccgggcctt tcgacctgca 2760 
gctcgaggtc gaccatgccc aagaagaaga ggaaggtgtc caatttactg accgtacacc 2820 
aaaatttgcc tgcattaccg gtcgatgcaa cgagtgatga ggttcgcaag aacctgatgg 2880 
acatgttcag ggatcgccag gcgttttctg agcatacctg gaaaatgctt ctgtccgttt 2940 
gccggtcgtg ggcggcatgg tgcaagttga ataaccggaa atggtttccc gcagaacctg 3000 
aagatgttcg cgattatctt ctatatcttc aggcgcgcgg tctggcagta aaaactatcc 3060 
agcaacattt gggccagcta aacatgcttc atcgtcggtc cgggctgcca cgaccaagtg 3120 
acagcaatgc tgtttcactg gttatgcggc ggatccgaaa agaaaacgtt gatgccggtg 3180 
aacgtgcaaa acaggctcta gcgttcgaac gcactgattt cgaccaggtt cgttcactca 3240 
tggaaaatag cgatcgctgc caggatatac gtaatctggc atttctgggg attgcttata 3300 
acaccctgtt acgtatagcc gaaattgcca ggatcagggt taaagatatc tcacgtactg 3360 
acggtgggag aatgttaatc catattggca gaacgaaaac gctggttagc accgcaggtg 3420 
tagagaaggc acttagcctg ggggtaacta aactggtcga gcgatggatt tccgtctctg 3480 
gtgtagctga tgatccgaat aactacctgt tttgccgggt cagaaaaaat ggtgttgccg 3540 
cgccatctgc caccagccag ctatcaactc gcgccctgga agggattttt gaagcaactc 3600 
atcgattgat ttacggcgct aaggatgact ctggtcagag atacctggcc tggtctggac 3660 
acagtgcccg tgtcggagcc gcgcgagata tggcccgcgc tggagtttca ataccggaga 3720 
tcatgcaagc tggtggctgg accaatgtaa atattgtcat gaactatatc cgtaacctgg 3780 
atagtgaaac aggggcaatg gtgcgcctgc tggaagatgg cgattagcca ttaacgcgta 3840 
aatgattgca gatccactag ttctagagct cgctgatcag cctcgactgt gccttctagt 3900 
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 3960 
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 4020 
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 4080 
aggcatgctg gggatgcggt gggctctatg gcttctgagn nngaaagaac cagctggggc 4140 
tcgagatcca ctagttctag cctcgaggct agagcggccg ccaccgcggt ggagctccaa 4200 
ttcgccctat agtgagtcgt attacgcgcg ctcactggcc gtcgttttac aacgtcgtga 4260 
ctgggaaaac cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag 4320 
ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa 4 380 
tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 4440 
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 4 500 
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 4560 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 4 620 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 4 680 
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 4740 
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 4800 
acaaaaattt aacgcgaatt ttaacaaaat attaacgctt acaattt , 4847 

/. / 

<210> 41 
<211> 22 
<212> DMA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer 
<400> 41 

catctccggg cctttcgacc tg 

<210> 42 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 42 

gcgatcggtg cgggcctctt c 
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Claims 

1. Use of a fusion protein comprising 

(a) a site-specific DNA recombinase domain and 

(b) a protein transduction domain (PTD) 

for preparing an agent for inducing target gene alterations in a living 
organism or cell culture, wherein said living organism carries at least one 
or more recognition sites for said site-specific DNA recombinase integrated 
in an endogenous gene. 

2. The use of claim 1, wherein the PTD is not derived from Antennapedia 
and preferably is a PTD derived from the VP22 protein of HSV or from the 
TAT protein of HIV. 

3. Use of a fusion protein comprising 

(a) a site-specific DNA recombinase domain and 

(b) a protein transduction domain (PTD) being not derived from 
Antennapedia and preferably being derived from the VP22 protein of HSV 
or from the TAT protein of HIV 

for preparing an agent for inducing target gene alterations in a living 
organism or cell culture, wherein said living organism carries at least one 
or more recognition sites for said site-specific DNA recombinase integrated 
in its genome. 

4. The use of claim 3, wherein the recognition sites for said site specific 
recombinase is present within an endogenous gene or a transgene. 

5. The use of any one of claims 2 to 4, wherein the TAT protein comprises 

(i) the amino acid sequence YGRKKRRQRRR (SEQ ID NO: 10) or a mutant 
thereof including 

(ii) peptides having the amino sequences 
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AGRKKRRQRRR (SEQ ID NO: 22) 
YARKARRQARR (SEQ ID NO:23) 
YARAAARQARA (SEQ ID NO:24) 
YARAARRAARR (SEQ ID NO : 25) 
YARAARRAARA (SEQ ID NO:26) 
YARRRRRRRRR (SEQ ID NO: 27) 
YAAARRRRRRR (SEQ ID NO:28); 

preferably the TAT protein consists of one of the sequences shown in (i) or 
(ii) above. 

6. The use of any one of claims 2 to 4, wherein the VP22 protein 
comprises the amino acid 16-157 of SEQ ID NO: 14. 

7. The use of any one of claims 1 to 6, wherein the site-specific DNA 
recombinase domain is selected from a recombinase protein derived from 
Cre, Flp, ((,031 recombinase, and R recombinase and preferably is Cre 
having amino acids 15 to 357 of SEQ ID NO: 2 or Flpe having amino acids 
15 to 437 of SEQ ID NO: 4. 

8. The use of any one of claims 1 to 7, wherein the protein transduction 
domain is fused to the N-terminal of the site-specific DNA recombinase 
domain. 

9. The use of any one of claims 1 to 8, wherein the protein transduction 
domain is fused to the site-specific DNA recombinase domain through a 
direct chemical bond or through a linker molecule. 

10. The use of any one of claim 9, wherein the linker molecule is a short 
peptide having 1 to 20, preferably 1 to 10 amino acid residues. 
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11. The use of any one of claims 1 to 10, wherein said fusion protein 
further comprises additional functional sequences. 

12. The use of claim 1, wherein the fusion protein has the sequence 
shown in SEQ ID NOs: 2, 4, 6 or 8. 

13. The use of any one of claims 1 to 12, wherein the living organism is a 
vertebrate, preferably a rodent or a fish. 

14. A method for inducing gene alterations in a living organism which 
comprises administering to said living organism, a fusion protein 
comprising a site-specific DNA recombinase domain and a protein 
transduction domain as defined in claims 1 to 12, wherein said living 
organism carries at least one or more recognition sites for said site- 
specific DNA recombinase integrated in its genome. 

15. A fusion protein comprising 

(a) a site-specific DNA recombinase domain as defined in claims 2 to 9 
and 

(b) a protein transduction domain (PTD) as defined in claims 2 to 9 
provided that when (a) is the wild-type Flp or Cre then (b) is not the full 
length VP22 protein of HSV. 

16. The fusion of claim 15, wherein the (PTD) is derived from the TAT 
protein of HIV. 

17. A DNA sequence coding for the fusion protein of claim 15 or 16, said 
DNA sequence preferably comprising the sequence shown in SEQ ID 
NOs:l, 3, 5, 7, 9, 11, 13, 18 and/or 20. 



18. A vector comprising the DNA sequence of claim 17. 



WO 01/49832 



73 



PCT/EP01/00060 



19. A host cell transformed with the vector of claim 18 and/or comprising 
the DIMA of claim 17. 

20. A method for producing the fusion protein of claim 15 which comprises 
culturing the transformed host cell of claim 19 and isolating the fusion 
protein. 

21. An injectable composition comprising the fusion protein as defined in 
claims 1 to 12 or 15 to 16. 
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Fig . 1 
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Fig. 5 
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Fig. 7 
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Figure 8 
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Fig. 10 
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